Zeyu Zheng

Cited by

	All	Since 2019
Citations	2048	1992
h-index	9	9
i10-index	9	9

1200

600

300

900

2017201820192020202120222023202412 44 91 136 192 167 211 1190

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Junhyuk OhResearch Scientist, DeepMindVerified email at google.com
Will DabneyDeepMindVerified email at google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com
Eric XingPresident at Mohamed bin Zayed University of AI, Professor of Computer Science, Carnegie Mellon UVerified email at cs.cmu.edu
Hao ZhangUC San DiegoVerified email at ucsd.edu
Razvan PascanuGoogle DeepMindVerified email at google.com
Clare LyleGoogle DeepMindVerified email at deepmind.com
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Rémi MunosGoogle DeepMindVerified email at inria.fr
Zhaohan Daniel GuoDeepMindVerified email at google.com
Yunhao TangResearch Scientist, DeepMindVerified email at columbia.edu
Daniele CalandrielloResearch Scientist, DeepMindVerified email at google.com
Wenfei FanProfessor of Web Data Management, University of EdinburghVerified email at inf.ed.ac.uk
Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of MichiganVerified email at umich.edu
Zhongwen XuTencentVerified email at tencent.com
David SilverDeepMind, UCLVerified email at google.com
Matteo HesselResearch Engineer, Google DeepMindVerified email at google.com
Haozhu WangAWS AIVerified email at amazon.com
Chengang JiPhD, University of Michigan-Ann ArborVerified email at umich.edu

Zeyu Zheng

DeepMind

Verified email at deepmind.com - Homepage

artificial intelligence machine learning reinforcement learning deep learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
Poseidon: An efficient communication architecture for distributed deep learning on {GPU} clusters H Zhang, Z Zheng, S Xu, W Dai, Q Ho, X Liang, Z Hu, J Wei, P Xie, ... 2017 USENIX Annual Technical Conference (USENIX ATC 17), 181-193, 2017	414	2017
On learning intrinsic rewards for policy gradient methods Z Zheng, J Oh, S Singh Advances in Neural Information Processing Systems, 4644-4654, 2018	202	2018
Parallelizing sequential graph computations W Fan, J Xu, Y Wu, W Yu, J Jiang, Z Zheng, B Zhang, Y Cao, C Tian Proceedings of the 2017 ACM International Conference on Management of Data …, 2017	121	2017
What Can Learned Intrinsic Rewards Capture? Z Zheng, J Oh, M Hessel, Z Xu, M Kroiss, H Van Hasselt, D Silver, S Singh International Conference on Machine Learning, 11436-11446, 2020	90	2020
Automated multi-layer optical design via deep reinforcement learning H Wang, Z Zheng, C Ji, LJ Guo Machine Learning: Science and Technology 2 (2), 025013, 2021	61	2021
Understanding plasticity in neural networks C Lyle, Z Zheng, E Nikishin, BA Pires, R Pascanu, W Dabney International Conference on Machine Learning, 23190-23211, 2023	48	2023
Generalized Preference Optimization: A Unified Approach to Offline Alignment Y Tang, ZD Guo, Z Zheng, D Calandriello, R Munos, M Rowland, ... arXiv preprint arXiv:2402.05749, 2024	23	2024
Understanding the performance gap between online and offline alignment algorithms Y Tang, DZ Guo, Z Zheng, D Calandriello, Y Cao, E Tarassov, R Munos, ... arXiv preprint arXiv:2405.08448, 2024	13	2024
Disentangling the Causes of Plasticity Loss in Neural Networks C Lyle, Z Zheng, K Khetarpal, H van Hasselt, R Pascanu, J Martens, ... arXiv preprint arXiv:2402.18762, 2024	9	2024
Adaptive Pairwise Weights for Temporal Credit Assignment Z Zheng, R Vuorio, R Lewis, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9225-9232, 2022	7*	2022
Learning State Representations from Random Deep Action-conditional Predictions Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh Advances in Neural Information Processing Systems 34, 23679-23691, 2021	6	2021
Towards multi‐agent reinforcement learning‐driven over‐the‐counter market simulations N Vadori, L Ardon, S Ganesh, T Spooner, S Amrouni, J Vann, M Xu, ... Mathematical Finance 34 (2), 262-347, 2024	5	2024
GrASP: Gradient-Based Affordance Selection for Planning V Veeriah, Z Zheng, R Lewis, S Singh arXiv preprint arXiv:2202.04772, 2022	4	2022
Human Alignment of Large Language Models through Online Preference Optimisation D Calandriello, D Guo, R Munos, M Rowland, Y Tang, BA Pires, ... arXiv preprint arXiv:2403.08635, 2024	3	2024
Normalization and effective learning rates in reinforcement learning C Lyle, Z Zheng, K Khetarpal, J Martens, H van Hasselt, R Pascanu, ... arXiv preprint arXiv:2407.01800, 2024		2024
Advances in Deep Reinforcement Learning: Intrinsic Rewards, Temporal Credit Assignment, State Representations, and Value-equivalent Models Z Zheng		2022
Reinforcement learning using meta-learned intrinsic rewards Z Zheng, J Oh, SS Baveja US Patent App. 17/033,410, 2021		2021
Towards Perpetually Trainable Neural Networks C Lyle, Z Zheng, K Khetarpal, R Pascanu, J Martens, H van Hasselt, ...
Supplementary Material: On Learning Intrinsic Rewards for Policy Gradient Methods Z Zheng, J Oh, S Singh

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors