Follow
Banghua Zhu
Title
Cited by
Cited by
Year
Bridging offline reinforcement learning and imitation learning: A tale of pessimism
P Rashidinejad, B Zhu, C Ma, J Jiao, S Russell
Advances in Neural Information Processing Systems 34, 11702-11716, 2021
2382021
Deconstructing Generative Adversarial Networks
B Zhu, J Jiao, D Tse
arXiv preprint arXiv:1901.09465, 2019
130*2019
Joint transceiver optimization for wireless communication PHY using neural network
B Zhu, J Wang, L He, J Song
IEEE Journal on Selected Areas in Communications 37 (6), 1364-1373, 2019
972019
Jump-start reinforcement learning
I Uchendu, T Xiao, Y Lu, B Zhu, M Yan, J Simon, M Bennice, C Fu, C Ma, ...
International Conference on Machine Learning, 34556-34583, 2023
602023
Principled Reinforcement Learning with Human Feedback from Pairwise or -wise Comparisons
B Zhu, J Jiao, MI Jordan
arXiv preprint arXiv:2301.11270, 2023
522023
Generalized resilience and robust statistics
B Zhu, J Jiao, J Steinhardt
The Annals of Statistics 50 (4), 2256-2283, 2022
412022
Robust estimation via generalized quasi-gradients
B Zhu, J Jiao, J Steinhardt
Information and Inference: A Journal of the IMA 11 (2), 581-636, 2022
372022
Sparse tensor decomposition for haplotype assembly of diploids and polyploids
A Hashemi, B Zhu, H Vikalo
BMC genomics 19, 1-15, 2018
252018
Byzantine-robust federated learning with optimal statistical rates
B Zhu, L Wang, Q Pang, S Wang, J Jiao, D Song, MI Jordan
International Conference on Artificial Intelligence and Statistics, 3151-3178, 2023
19*2023
The sample complexity of online contract design
B Zhu, S Bates, Z Yang, Y Wang, J Jiao, MI Jordan
arXiv preprint arXiv:2211.05732, 2022
192022
When does the tukey median work?
B Zhu, J Jiao, J Steinhardt
2020 IEEE International Symposium on Information Theory (ISIT), 1201-1206, 2020
152020
S-lora: Serving thousands of concurrent lora adapters
Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ...
arXiv preprint arXiv:2311.03285, 2023
132023
Fine-Tuning Language Models with Advantage-Induced Policy Alignment
B Zhu, H Sharma, FV Frujeri, S Dong, C Zhu, MI Jordan, J Jiao
arXiv preprint arXiv:2306.02231, 2023
122023
Minimax off-policy evaluation for multi-armed bandits
C Ma, B Zhu, J Jiao, MJ Wainwright
IEEE Transactions on Information Theory 68 (8), 5314-5339, 2022
112022
Linear representation meta-reinforcement learning for instant adaptation
M Peng, B Zhu, J Jiao
arXiv preprint arXiv:2101.04750, 2021
92021
Online learning in stackelberg games with an omniscient follower
G Zhao, B Zhu, J Jiao, M Jordan
International Conference on Machine Learning, 42304-42316, 2023
82023
On Optimal Caching and Model Multiplexing for Large Model Inference
B Zhu, Y Sheng, L Zheng, C Barrett, MI Jordan, J Jiao
arXiv preprint arXiv:2306.02003, 2023
72023
Noisy Sorting Capacity
Z Wang, N Ghaddar, B Zhu, L Wang
arXiv preprint arXiv:2202.01446, 2023
72023
Pairwise proximal policy optimization: Harnessing relative feedback for llm alignment
T Wu, B Zhu, R Zhang, Z Wen, K Ramchandran, J Jiao
arXiv preprint arXiv:2310.00212, 2023
42023
Online learning in a creator economy
B Zhu, SP Karimireddy, J Jiao, MI Jordan
arXiv preprint arXiv:2305.11381, 2023
42023
The system can't perform the operation now. Try again later.
Articles 1–20