Object-adaptive LSTM network for real-time visual tracking with adversarial data augmentation Y Du, Y Yan, S Chen, Y Hua Neurocomputing 384, 67-83, 2020 | 22 | 2020 |
Combinatorial pure exploration with full-bandit or partial linear feedback Y Du, Y Kuroki, W Chen Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021 | 21* | 2021 |
Provably efficient risk-sensitive reinforcement learning: iterated CVaR and worst path Y Du, S Wang, L Huang International Conference on Learning Representations (ICLR), 2023 | 16 | 2023 |
Collaborative pure exploration in kernel bandit Y Du, W Chen, Y Yuroki, L Huang International Conference on Learning Representations (ICLR), 2023 | 14 | 2023 |
Combinatorial pure exploration for dueling bandit W Chen, Y Du, L Huang, H Zhao (*in alphabetical order) International Conference on Machine Learning (ICML), 1531-1541, 2020 | 12 | 2020 |
Object-adaptive LSTM network for visual tracking Y Du, Y Yan, S Chen, Y Hua, H Wang International Conference on Pattern Recognition (ICPR), 1719-1724, 2018 | 6 | 2018 |
A one-size-fits-all solution to conservative bandit problems Y Du, S Wang, L Huang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021 | 5 | 2021 |
Continuous mean-covariance bandits Y Du, S Wang, Z Fang, L Huang Advances in Neural Information Processing Systems (NeurIPS) 34, 875-886, 2021 | 4 | 2021 |
Provably safe reinforcement learning with step-wise violation constraints N Xiong, Y Du, L Huang Advances in Neural Information Processing Systems (NeurIPS) 36, 2024 | 3 | 2024 |
Combinatorial pure exploration with bottleneck reward function Y Du, Y Kuroki, W Chen Advances in Neural Information Processing Systems (NeurIPS) 34, 23956-23967, 2021 | 3 | 2021 |
Dueling bandits: from two-dueling to multi-dueling Y Du, S Wang, L Huang International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020 | 3 | 2020 |
Provably efficient iterated cvar reinforcement learning with function approximation Y Chen, Y Du, P Hu, S Wang, D Wu, L Huang International Conference on Learning Representations (ICLR), 2023 | 2 | 2023 |
Multi-task Representation Learning for Pure Exploration in Linear Bandits Y Du, L Huang, W Sun International Conference on Machine Learning (ICML), 2023 | 2 | 2023 |
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization Y Du, A Winnicki, G Dalal, S Mannor, R Srikant arXiv preprint arXiv:2402.10342, 2024 | | 2024 |
Cascading Reinforcement Learning Y Du, R Srikant, W Chen International Conference on Learning Representations (ICLR, spotlight), 2024 | | 2024 |
Branching reinforcement learning Y Du, W Chen International Conference on Machine Learning (ICML), 2022 | | 2022 |