Rohan Anil

Cited by

	All	Since 2019
Citations	8700	8350
h-index	21	21
i10-index	25	25

3100

1550

775

2325

2017201820192020202120222023202498 223 526 770 993 1171 1863 3015

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ehsan AmidSenior Research Scientist at Google DeepMindVerified email at google.com
Tomer KorenAssociate Professor at Tel Aviv UniversityVerified email at tauex.tau.ac.il
Vineet GuptaGoogle IncVerified email at google.com
George E. DahlGoogle Inc.Verified email at google.com
Christopher FiftyStanford UniversityVerified email at cornell.edu
Naman AgarwalSenior Research Scientist, Google AI PrincetonVerified email at google.com
Chelsea FinnStanford University, GoogleVerified email at cs.stanford.edu
Robert OrmandiGoogleVerified email at google.com
Alexandre PassosOpenAIVerified email at cs.umass.edu
Geoffrey HintonEmeritus Prof. Computer Science, University of TorontoVerified email at cs.toronto.edu
Cyril ZhangMicrosoft Research NYCVerified email at microsoft.com
Elad HazanProfessor at Princeton University and Director Google AI PrincetonVerified email at princeton.edu
Kunal TalwarApple IncVerified email at apple.com
Patrick NguyenResearch Scientist, Google, Inc.Verified email at google.com
Jonathan ShenGoogleVerified email at google.com
Mia Xu ChenGoogle BrainVerified email at google.com

Rohan Anil

Principal Engineer, Google Brain

Verified email at google.com

machine learning neural networks large scale training optimization algorithms


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Wide & deep learning for recommender systems HT Cheng, L Koc, J Harmsen, T Shaked, T Chandra, H Aradhye, ... Proceedings of the 1st workshop on deep learning for recommender systems, 7-10, 2016	3995	2016
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	1056	2023
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
Large scale distributed neural network training through online distillation R Anil, G Pereyra, AT Passos, R Ormandi, G Dahl, G Hinton Sixth International Conference on Learning Representations, 2018	481	2018
Knowledge distillation: A good teacher is patient and consistent L Beyer, X Zhai, A Royer, L Markeeva, R Anil, A Kolesnikov Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	262	2022
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	254	2024
Efficiently Identifying Task Groupings for Multi-Task Learning C Fifty, E Amid, Z Zhao, T Yu, R Anil, C Finn 2021 Conference on Neural Information Processing Systems, Spotlight, 2021	247	2021
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019	203	2019
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	196	2024
Tf-ranking: Scalable tensorflow library for learning-to-rank RK Pasumarthi, S Bruch, X Wang, C Li, M Bendersky, M Najork, J Pfeifer, ... Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019	153	2019
Robust bi-tempered logistic loss based on bregman divergences E Amid, MK Warmuth, R Anil, T Koren 2019 Conference on Neural Information Processing Systems, 2019	130	2019
Gemini: A family of highly capable multimodal models R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805 1, 2023	109	2023
Large-Scale Differentially Private BERT R Anil, B Ghazi, V Gupta, R Kumar, P Manurangsi Privacy Preserving Machine Learning, 2021	108	2021
Scalable Second Order Optimization for Deep Learning R Anil, V Gupta, T Koren, K Regan, Y Singer arXiv preprint arXiv:2002.09018, 2020, 2020	108*	2020
Memory-efficient adaptive optimization for large-scale learning R Anil, V Gupta, T Koren, Y Singer 2019 Conference on Neural Information Processing Systems, 2019	62*	2019
Sunipa Dev R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... Jacob Devlin, Mark Díaz, Nan Du, Ethan Dyer, Vladimir Feinberg, Fangxiaoyu …, 2023	60	2023
Disentangling adaptive gradient methods from learning rates N Agarwal, R Anil, E Hazan, T Koren, C Zhang arXiv preprint arXiv:2002.11803, 2020	39	2020
A large batch optimizer reality check: Traditional, generic optimizers suffice across batch sizes Z Nado, JM Gilmer, CJ Shallue, R Anil, GE Dahl arXiv preprint arXiv:2102.06356, 2021	38	2021
Wide and deep machine learning models T Shaked, R Anil, HB Aradhye, G Anderson, W Chai, ML Koc, J Harmsen, ... US Patent 10,762,422, 2020	34	2020
On the factory floor: ML engineering for industrial-scale ads recommendation models R Anil, S Gadanho, D Huang, N Jacob, Z Li, D Lin, T Phillips, C Pop, ... arXiv preprint arXiv:2209.05310, 2022	24	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors