Banghua Zhu

Cited by

	All	Since 2019
Citations	1304	1302
h-index	16	16
i10-index	24	24

580

290

145

435

20192020202120222023202422 55 100 197 351 564

Public access

View all

16 articles

1 article

available

not available

Based on funding mandates

Co-authors

Jiantao JiaoAssistant Professor of EECS and Statistics, University of California, BerkeleyVerified email at berkeley.edu
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyVerified email at cs.berkeley.edu
Cong MaUniversity of ChicagoVerified email at uchicago.edu
Stuart RussellProfessor of Computer Science, University of California, BerkeleyVerified email at cs.berkeley.edu
Ying ShengPhD student of Stanford UniversityVerified email at stanford.edu
Lianmin ZhengUC BerkeleyVerified email at berkeley.edu
Ion StoicaProfessor of Computer Science, UC BerkeleyVerified email at cs.berkeley.edu
Joseph E. GonzalezProfessor of Computer Science, UC BerkeleyVerified email at berkeley.edu
Paria RashidinejadPostdoctoral Scholar, University of California, BerkeleyVerified email at berkeley.edu
Tianle LiUndergraduate Researcher, UC BerkeleyVerified email at berkeley.edu
Dacheng LiUC BerkeleyVerified email at berkeley.edu
Jacob SteinhardtStanford UniversityVerified email at cs.stanford.edu
Tianhao WuUniversity of California, BerkeleyVerified email at berkeley.edu
Evan FrickUC BerkeleyVerified email at berkeley.edu
Hanlin ZhuPh.D. student, University of California, BerkeleyVerified email at berkeley.edu
Kurt KeutzerProfessor of the Graduate School, EECS, University of California, BerkeleyVerified email at berkeley.edu
Song JianTsinghua UniversityVerified email at tsinghua.edu.cn
Shiyi CaoUC BerkeleyVerified email at berkeley.edu
Ikechukwu UchenduHarvard UniversityVerified email at g.harvard.edu
Lele WangUniversity of British ColumbiaVerified email at ece.ubc.ca

Banghua Zhu

University of California, Berkeley

Verified email at berkeley.edu - Homepage

foundation models human-AI interaction statistics information theory reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bridging offline reinforcement learning and imitation learning: A tale of pessimism P Rashidinejad, B Zhu, C Ma, J Jiao, S Russell Advances in Neural Information Processing Systems 34, 11702-11716, 2021	288	2021
Deconstructing Generative Adversarial Networks B Zhu, J Jiao, D Tse arXiv preprint arXiv:1901.09465, 2019	141*	2019
Principled reinforcement learning with human feedback from pairwise or k-wise comparisons B Zhu, M Jordan, J Jiao International Conference on Machine Learning, 43037-43067, 2023	117	2023
Joint transceiver optimization for wireless communication PHY using neural network B Zhu, J Wang, L He, J Song IEEE Journal on Selected Areas in Communications 37 (6), 1364-1373, 2019	105	2019
Jump-start reinforcement learning I Uchendu, T Xiao, Y Lu, B Zhu, M Yan, J Simon, M Bennice, C Fu, C Ma, ... International Conference on Machine Learning, 34556-34583, 2023	86	2023
Chatbot arena: An open platform for evaluating llms by human preference WL Chiang, L Zheng, Y Sheng, AN Angelopoulos, T Li, D Li, H Zhang, ... arXiv preprint arXiv:2403.04132, 2024	72	2024
Starling-7B: Improving LLM Helpfulness & Harmlessness with RLAIF B Zhu, E Frick, T Wu, H Zhu, J Jiao https://starling.cs.berkeley.edu/, 2023	48	2023
Generalized resilience and robust statistics B Zhu, J Jiao, J Steinhardt The Annals of Statistics 50 (4), 2256-2283, 2022	48	2022
Robust estimation via generalized quasi-gradients B Zhu, J Jiao, J Steinhardt Information and Inference: A Journal of the IMA 11 (2), 581-636, 2022	43	2022
The sample complexity of online contract design B Zhu, S Bates, Z Yang, Y Wang, J Jiao, MI Jordan arXiv preprint arXiv:2211.05732, 2022	40	2022
S-lora: Serving thousands of concurrent lora adapters Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ... arXiv preprint arXiv:2311.03285, 2023	39	2023
Byzantine-robust federated learning with optimal statistical rates B Zhu, L Wang, Q Pang, S Wang, J Jiao, D Song, MI Jordan International Conference on Artificial Intelligence and Statistics, 3151-3178, 2023	28*	2023
Sparse tensor decomposition for haplotype assembly of diploids and polyploids A Hashemi, B Zhu, H Vikalo BMC genomics 19, 1-15, 2018	27	2018
Fine-tuning language models with advantage-induced policy alignment B Zhu, H Sharma, FV Frujeri, S Dong, C Zhu, MI Jordan, J Jiao arXiv preprint arXiv:2306.02231, 2023	24	2023
Pairwise proximal policy optimization: Harnessing relative feedback for llm alignment T Wu, B Zhu, R Zhang, Z Wen, K Ramchandran, J Jiao arXiv preprint arXiv:2310.00212, 2023	18	2023
When does the Tukey median work? B Zhu, J Jiao, J Steinhardt 2020 IEEE International Symposium on Information Theory (ISIT), 1201-1206, 2020	18	2020
Online learning in stackelberg games with an omniscient follower G Zhao, B Zhu, J Jiao, M Jordan International Conference on Machine Learning, 42304-42316, 2023	16	2023
Minimax off-policy evaluation for multi-armed bandits C Ma, B Zhu, J Jiao, MJ Wainwright IEEE Transactions on Information Theory 68 (8), 5314-5339, 2022	13	2022
Fairness in serving large language models Y Sheng, S Cao, D Li, B Zhu, Z Li, D Zhuo, JE Gonzalez, I Stoica 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2024	11	2024
Noisy Sorting Capacity Z Wang, N Ghaddar, B Zhu, L Wang arXiv preprint arXiv:2202.01446, 2023	11	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors