Follow
Xing Liu
Xing Liu
Research Scientist, Meta Platforms, Inc.
Verified email at fb.com
Title
Cited by
Cited by
Year
Efficient sparse matrix-vector multiplication on x86-based many-core processors
X Liu, M Smelyanskiy, E Chow, P Dubey
Proceedings of the 27th international ACM conference on International …, 2013
3232013
FROSTT: The formidable repository of open sparse tensors and tools
S Smith, JW Choi, J Li, R Vuduc, J Park, X Liu, G Karypis
1422017
Algorithmic time, energy, and power on candidate HPC compute building blocks
J Choi, M Dukhan, X Liu, R Vuduc
2014 IEEE 28th international parallel and distributed processing symposium …, 2014
1002014
Efficient shared-memory implementation of high-performance conjugate gradient benchmark and its application to unstructured matrices
J Park, M Smelyanskiy, K Vaidyanathan, A Heinecke, DD Kalamkar, X Liu, ...
SC'14: Proceedings of the International Conference for High Performance …, 2014
642014
Software-hardware co-design for fast and scalable training of deep learning recommendation models
D Mudigere, Y Hao, J Huang, Z Jia, A Tulloch, S Sridharan, X Liu, ...
Proceedings of the 49th Annual International Symposium on Computer …, 2022
602022
Tt-rec: Tensor train compression for deep learning recommendation models
C Yin, B Acun, CJ Wu, X Liu
Proceedings of Machine Learning and Systems 3, 448-462, 2021
602021
Truss decomposition on shared-memory parallel systems
S Smith, X Liu, NK Ahmed, AS Tom, F Petrini, G Karypis
2017 IEEE High Performance Extreme Computing Conference (HPEC), 1-6, 2017
582017
Optimizing sparse matrix-vector multiplication for large-scale data analytics
D Buono, F Petrini, F Checconi, X Liu, X Que, C Long, TC Tuan
Proceedings of the 2016 International Conference on Supercomputing, 1-12, 2016
552016
High-performance, distributed training of large-scale deep learning recommendation models
D Mudigere, Y Hao, J Huang, A Tulloch, S Sridharan, X Liu, M Ozdal, ...
arXiv preprint arXiv:2104.05158, 2021
432021
Parallel scalability of Hartree–Fock calculations
E Chow, X Liu, M Smelyanskiy, JR Hammond
The Journal of chemical physics 142 (10), 2015
362015
A new scalable parallel algorithm for Fock matrix construction
X Liu, A Patel, E Chow
2014 IEEE 28th international parallel and distributed processing symposium …, 2014
332014
Improving the performance of dynamical simulations via multiple right-hand sides
X Liu, E Chow, K Vaidyanathan, M Smelyanskiy
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
332012
Blocking Optimization Techniques for Sparse Tensor Computation
J Choi, X Liu, S Smith, T Simon
2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018
322018
On optimizing distributed tucker decomposition for dense tensors
VT Chakaravarthy, JW Choi, DJ Joseph, X Liu, P Murali, Y Sabharwal, ...
Parallel and Distributed Processing Symposium (IPDPS), 2017 IEEE …, 2017
322017
High-performance dense tucker decomposition on GPU clusters
J Choi, X Liu, V Chakaravarthy
SC18: International Conference for High Performance Computing, Networking …, 2018
292018
Scaling up Hartree–Fock Calculations on Tianhe-2
E Chow, X Liu, S Misra, M Dukhan, M Smelyanskiy, JR Hammond, Y Du, ...
International Journal of High Performance Computing Applications, 2015
272015
Genome sequences for five strains of the emerging pathogen Haemophilus haemolyticus
IK Jordan, AB Conley, IV Antonov, RA Arthur, ED Cook, GP Cooper, ...
Journal of Bacteriology 193 (20), 5879-5880, 2011
272011
Mahmoud khorashadi, Pallab Bhattacharya, Petr Lapukhov, Maxim Naumov, Ajit Mathews, Lin Qiao, Mikhail Smelyanskiy, Bill Jia, and Vijay Rao. 2021. Software-Hardware Co-design …
D Mudigere, Y Hao, J Huang, Z Jia, A Tulloch, S Sridharan, X Liu, ...
arXiv preprint arXiv:2104.05158, 2022
262022
A Sparse Direct Solver for Distributed Memory Xeon Phi-accelerated Systems
P Sao, X Liu, R Vuduc, X Li
Parallel and Distributed Processing Symposium, 2015 IEEE 29th International …, 2015
242015
Picture processing via a shared decoded picture pool
Y Yuan, R Yan, S Xu, X Liu, HD Li
US Patent 8,300,704, 2012
242012
The system can't perform the operation now. Try again later.
Articles 1–20