Tianle Cai

Cited by

	All	Since 2019
Citations	1957	1956
h-index	16	16
i10-index	17	17

860

430

215

645

20192020202120222023202421 64 175 475 843 377

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Di HePeking UniversityVerified email at pku.edu.cn
Liwei WangProfessor, Peking UniversityVerified email at cis.pku.edu.cn
Shengjie LuoPhD Student, Peking UniversityVerified email at stu.pku.edu.cn
Shuxin ZhengPrincipal Researcher, Microsoft ResearchVerified email at microsoft.com
Ruiqi GaoPhD Student, Princeton UniversityVerified email at princeton.edu
Jason D. LeeAssociate Professor of Electrical Engineering and Computer Science, Princeton UniversityVerified email at princeton.edu
Yuhong LiUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu
Haotian YeComputer Science Ph.D. at Stanford UniversityVerified email at stanford.edu
Runtian ZhaiPhD Student, Carnegie Mellon UniversityVerified email at cmu.edu
Denny ZhouResearch Scientist, Google DeepMindVerified email at google.com
Tengyu MAStanford UniversityVerified email at stanford.edu
Xuezhi WangResearch Scientist, Google DeepMindVerified email at google.com
Xinyun ChenGoogle DeepMindVerified email at berkeley.edu
Debadeepta DeyPrincipal Researcher, Microsoft Research | Azure AIVerified email at microsoft.com
Yi ZhangSenior Researcher at Microsoft Research RedmondVerified email at microsoft.com
Qi LeiAssistant Professor of Mathematics and Data Science, New York UniversityVerified email at nyu.edu
Tri DaoPrinceton University, Together AIVerified email at princeton.edu
Zexuan ZhongPrinceton UniversityVerified email at princeton.edu
Yikang ShenMIT-IBM Watson LabVerified email at ibm.com
Zengyi QinMassachusetts Institute of TechnologyVerified email at mit.edu

Tianle Cai

PhD Student, Princeton University

Verified email at princeton.edu - Homepage

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Do Transformers Really Perform Badly for Graph Representation? C Ying, T Cai, S Luo, S Zheng, G Ke, D He, Y Shen, TY Liu NeurIPS 2021, arXiv preprint arXiv:2106.05234, 2021	908	2021
Adversarially robust generalization just requires more unlabeled data R Zhai, T Cai, D He, C Dan, K He, J Hopcroft, L Wang arXiv preprint arXiv:1906.00555, 2019	148	2019
Convergence of adversarial training in overparametrized neural networks R Gao, T Cai, H Li, CJ Hsieh, L Wang, JD Lee NeurIPS 2019 Spotlight, arXiv preprint arXiv:1906.07916, 13029-13040, 2019	135	2019
Graphnorm: A principled approach to accelerating graph neural network training T Cai, S Luo, K Xu, D He, T Liu, L Wang ICML 2021, arXiv preprint arXiv:2009.03294, 2020	134	2020
Towards a Theoretical Framework of Out-of-Distribution Generalization H Ye, C Xie, T Cai, R Li, Z Li, L Wang NeurIPS 2021, arXiv preprint arXiv:2106.04496, 2021	87	2021
Large language models as tool makers T Cai, X Wang, T Ma, X Chen, D Zhou ICLR 2024, arXiv preprint arXiv:2305.17126, 2023	85	2023
Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot J Su, Y Chen, T Cai, T Wu, R Gao, L Wang, JD Lee NeurIPS 2020, arXiv preprint arXiv:2009.11094, 2020	71	2020
Gram-Gauss-Newton Method: Learning Overparameterized Neural Networks for Regression Problems T Cai, R Gao, J Hou, S Chen, D Wang, D He, Z Zhang, L Wang NeurIPS 2019 Beyond First Order Methods in ML Workshop, arXiv preprint arXiv …, 2019	61	2019
Towards Certifying L-infinity Robustness using Neural Networks with L-inf-dist Neurons B Zhang, T Cai, Z Lu, D He, L Wang ICML 2021, arXiv preprint arXiv:2102.05363, 12368-12379, 2021	58*	2021
What Makes Convolutional Models Great on Long Sequence Modeling? Y Li, T Cai, Y Zhang, D Chen, D Dey ICLR 2023, arXiv preprint arXiv:2210.09298, 2022	54	2022
Locally differentially private (contextual) bandits learning K Zheng, T Cai, W Huang, Z Li, L Wang NeurIPS 2020, arXiv preprint arXiv:2006.00701, 2020	50	2020
A Theory of Label Propagation for Subpopulation Shift T Cai, R Gao, JD Lee, Q Lei ICML 2021, arXiv preprint arXiv:2102.11203, 2021	42	2021
Medusa: Simple llm inference acceleration framework with multiple decoding heads T Cai, Y Li, Z Geng, H Peng, JD Lee, D Chen, T Dao arXiv preprint arXiv:2401.10774, 2024	36*	2024
Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding S Luo, S Li, T Cai, D He, D Peng, S Zheng, G Ke, L Wang, TY Liu NeurIPS 2021, arXiv preprint arXiv:2106.12566, 2021	36	2021
Defective Convolutional Networks T Luo, T Cai, M Zhang, S Chen, D He, L Wang arXiv preprint arXiv:1911.08432, 2019	20*	2019
Rest: Retrieval-based speculative decoding Z He, Z Zhong, T Cai, JD Lee, D He NAACL 2024, arXiv preprint arXiv:2311.08252, 2023	17	2023
Reward collapse in aligning large language models Z Song, T Cai, JD Lee, WJ Su arXiv preprint arXiv:2305.17608, 2023	13	2023
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models M Li, T Cai, J Cao, Q Zhang, H Cai, J Bai, Y Jia, MY Liu, K Li, S Han CVPR 2024, arXiv preprint arXiv:2402.19481, 2024	1	2024
BitDelta: Your Fine-Tune May Only Be Worth One Bit J Liu, G Xiao, K Li, JD Lee, S Han, T Dao, T Cai arXiv preprint arXiv:2402.10193, 2024	1	2024
JetMoE: Reaching Llama2 Performance with 0.1 M Dollars Y Shen, Z Guo, T Cai, Z Qin arXiv preprint arXiv:2404.07413, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors