Follow
Chenjia Bai
Chenjia Bai
The Institute of AI (TeleAI), China Telecom
Verified email at chinatelecom.cn - Homepage
Title
Cited by
Cited by
Year
Exploration in Deep Reinforcement Learning: From Single-Agent to Multi-Agent Domain
J Hao, T Yang, H Tang, C Bai, J Liu, Z Meng, P Liu, Z Wang
IEEE Transactions on Neural Networks and Learning Systems, 2023
210*2023
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
C Bai, L Wang, Z Yang, Z Deng, A Garg, P Liu, Z Wang
International Conference on Learning representations (ICLR), 2022
1342022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
R Yang, C Bai, X Ma, Z Wang, C Zhang, L Han
Neural Information Processing Systems (NeurIPS), 2022
642022
Survey on Sparse Reward in Deep Reinforcement Learning
W Yang, C Bai, C Cai, Y Zhao, P Liu
计算机科学 47 (3), 182-191, 2020
46*2020
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
H He, C Bai, K Xu, Z Yang, W Zhang, D Wang, B Zhao, X Li
Neural Information Processing Systems (NeurIPS), 2023
442023
Principled Exploration via Optimistic Bootstrapping and Backward Induction
C Bai, L Wang, L Han, J Hao, A Garg, P Liu, Z Wang
International Conference on Machine Learning (ICML), 2021
432021
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
S Qiu, L Wang, C Bai, Z Yang, Z Wang
International Conference on Machine Learning (ICML), 18168-18210, 2022
332022
Dynamic Bottleneck for Robust Self-Supervised Exploration
C Bai, L Wang, L Han, A Garg, J Hao, P Liu, Z Wang
Neural Information Processing Systems (NeurIPS), 2021
252021
Guided Goal Generation for Hindsight Multi-Goal Reinforcement Learning
C Bai, P Liu, W Zhao, X Tang
Neurocomputing 359, 353-367, 2019
252019
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
C Bai, P Liu, K Liu, L Wang, Y Zhao, L Han, Z Wang
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
172021
False Correlation Reduction for Offline Reinforcement Learning
Z Deng, Z Fu, L Wang, Z Yang, C Bai, T Zhou, Z Wang, J Jiang
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
15*2023
Addressing Hindsight Bias in Multi-Goal Reinforcement Learning
C Bai, L Wang, Y Wang, Z Wang, R Zhao, C Bai, P Liu
IEEE Transactions on Cybernetics, 2021
152021
Active Sampling for Deep Q-learning Based on TD-error Adaptive Correction
C Bai, P Liu, W Zhao, X Tang
计算机研究与发展 56 (2), 262-280, 2019
12*2019
Behavior Contrastive Learning for Unsupervised Skill Discovery
R Yang, C Bai, H Guo, S Li, B Zhao, Z Wang, P Liu, X Li
International Conference on Machine Learning (ICML), 2023
102023
Generating Attentive Goals for Prioritized Hindsight Reinforcement Learning
P Liu, C Bai, Y Zhao, C Bai, W Zhao, X Tang
Knowledge-Based Systems 203, 106140, 2020
102020
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
K Xu, C Bai, X Ma, D Wang, B Zhao, Z Wang, X Li, W Li
Neural Information Processing Systems (NeurIPS), 2023
82023
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning
C Bai, T Xiao, Z Zhu, L Wang, F Zhou, A Garg, B He, P Liu, Z Wang
IEEE Transactions on Neural Networks and Learning Systems, 2022
82022
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration
Y Zhang, S Yang, C Bai, F Wu, X Li, X Li, Z Wang
arXiv preprint arXiv:2405.14314, 2024
72024
Large-Scale Actionless Video Pre-training via Discrete Diffusion for Efficient Policy Learning
H He, C Bai, L Pan, W Zhang, B Zhao, X Li
arXiv preprint arXiv:2402.14407, 2024
72024
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning
J Shi, C Bai, H He, L Han, D Wang, B Zhao, X Li, X Li
IEEE International Conference on Robotics and Automation (ICRA), 2024
72024
The system can't perform the operation now. Try again later.
Articles 1–20