Resnest: Split-attention networks. H Zhang, C Wu, Z Zhang, Y Zhu, Z Zhang, H Lin, Y Sun, T He, J Mueller, ... Conference on Computer Vision and Pattern (ECV), 2022 | 1176 | 2022 |
Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs M Wang, L Yu, Q Gan, D Zheng, Y Gai, Z Ye, M Li, J Zhou, Q Huang, ... International Conference on Learning Representations, 2019 | 534 | 2019 |
Self-Driving Database Management Systems. A Pavlo, G Angulo, J Arulraj, H Lin, J Lin, L Ma, P Menon, TC Mowry, ... CIDR 4, 1, 2017 | 272 | 2017 |
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing J Guo, H He, T He, L Lausen, M Li, H Lin, X Shi, C Wang, J Xie, S Zha, ... Journal of Machine Learning Research, 2019 | 178 | 2019 |
Is Network the Bottleneck of Distributed Training? Z Zhang, C Chang, H Lin, Y Wang, R Arora, X Jin SIGCOMM NetAI, 2020 | 44 | 2020 |
Temporal-Contextual Recommendation in Real-Time Y Ma, BM Narayanaswamy, H Lin, H Ding KDD 2020, 2020 | 39 | 2020 |
Local AdaAlter: Communication-Efficient Stochastic Gradient Descent with Adaptive Learning Rates C Xie, O Koyejo, I Gupta, H Lin NeurIPS 2020, optimizations for machine learning, 2019 | 29 | 2019 |
CSER: Communication-efficient SGD with Error Reset C Xie, S Zheng, OO Koyejo, I Gupta, M Li, H Lin Advances in Neural Information Processing Systems 33, 2020 | 25 | 2020 |
Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources H Lin, H Zhang, Y Ma, T He, Z Zhang, S Zha, M Li arXiv preprint arXiv:1904.12043, 2019 | 18 | 2019 |
Accelerated Large Batch Optimization of BERT Pretraining in 54 minutes S Zheng, H Lin, S Zha, M Li arXiv preprint arXiv:2006.13484, 2020 | 16 | 2020 |
Compressed Communication for Distributed Training: Adaptive Methods and System Y Zhong, C Xie, S Zheng, H Lin arXiv preprint arXiv:2105.07829, 2021 | 5 | 2021 |
Deep graph library M Wang, L Yu, Q Gan, D Zheng, Y Gai, Z Ye, M Li, J Zhou, Q Huang, ... | 5 | 2018 |
Just-in-Time Dynamic-Batching S Zha, Z Jiang, H Lin, Z Zhang Conference on Neural Information Processing Systems, 2018 | 3 | 2018 |
dPRO: A Generic Performance Diagnosis and Optimization Toolkit for Expediting Distributed DNN Training H Hu, C Jiang, Y Zhong, Y Peng, C Wu, Y Zhu, H Lin, C Guo Proceedings of Machine Learning and Systems 4, 623-637, 2022 | 2 | 2022 |
Dive into Deep Learning for Natural Language Processing H Lin, X Shi, L Lausen, A Zhang, H He, S Zha, A Smola Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019 | 2 | 2019 |
Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies Z Wang, H Lin, Y Zhu, TSE Ng | 1 | 2023 |
SAPipe: Staleness-Aware Pipeline for Data Parallel DNN Training Y Chen, C Xie, M Ma, J Gu, Y Peng, H Lin, C Wu, Y Zhu Advances in Neural Information Processing Systems, 2022 | | 2022 |