‪Yihan Du‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	113	113
h-index	6	6
i10-index	5	5

0

38

19

202020212022202320248 24 25 38 18

Public access

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Longbo HuangProfessor, IIIS @ Tsinghua University, China, ACM Distinguished ScientistVerified email at tsinghua.edu.cn
Wei Chen （陈卫）Microsoft ResearchVerified email at microsoft.com
Haoyu ZhaoPrinceton UniversityVerified email at princeton.edu
Wen SunAssistant Professor, Cornell UniversityVerified email at cornell.edu
R. SrikantUniversity of Illinois at Urbana-ChampaignVerified email at illinois.edu

Yihan Du

Yihan Du

Postdoc, University of Illinois at Urbana-Champaign

Verified email at illinois.edu - Homepage

Online Learning Reinforcement Learning Representation Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Object-adaptive LSTM network for real-time visual tracking with adversarial data augmentation Y Du, Y Yan, S Chen, Y Hua Neurocomputing 384, 67-83, 2020	22	2020
Combinatorial pure exploration with full-bandit or partial linear feedback Y Du, Y Kuroki, W Chen Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021	21*	2021
Provably efficient risk-sensitive reinforcement learning: iterated CVaR and worst path Y Du, S Wang, L Huang International Conference on Learning Representations (ICLR), 2023	16	2023
Collaborative pure exploration in kernel bandit Y Du, W Chen, Y Yuroki, L Huang International Conference on Learning Representations (ICLR), 2023	14	2023
Combinatorial pure exploration for dueling bandit W Chen, Y Du, L Huang, H Zhao (*in alphabetical order) International Conference on Machine Learning (ICML), 1531-1541, 2020	12	2020
Object-adaptive LSTM network for visual tracking Y Du, Y Yan, S Chen, Y Hua, H Wang International Conference on Pattern Recognition (ICPR), 1719-1724, 2018	6	2018
A one-size-fits-all solution to conservative bandit problems Y Du, S Wang, L Huang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) 35 (8 …, 2021	5	2021
Continuous mean-covariance bandits Y Du, S Wang, Z Fang, L Huang Advances in Neural Information Processing Systems (NeurIPS) 34, 875-886, 2021	4	2021
Provably safe reinforcement learning with step-wise violation constraints N Xiong, Y Du, L Huang Advances in Neural Information Processing Systems (NeurIPS) 36, 2024	3	2024
Combinatorial pure exploration with bottleneck reward function Y Du, Y Kuroki, W Chen Advances in Neural Information Processing Systems (NeurIPS) 34, 23956-23967, 2021	3	2021
Dueling bandits: from two-dueling to multi-dueling Y Du, S Wang, L Huang International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020	3	2020
Provably efficient iterated cvar reinforcement learning with function approximation Y Chen, Y Du, P Hu, S Wang, D Wu, L Huang International Conference on Learning Representations (ICLR), 2023	2	2023
Multi-task Representation Learning for Pure Exploration in Linear Bandits Y Du, L Huang, W Sun International Conference on Machine Learning (ICML), 2023	2	2023
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization Y Du, A Winnicki, G Dalal, S Mannor, R Srikant arXiv preprint arXiv:2402.10342, 2024		2024
Cascading Reinforcement Learning Y Du, R Srikant, W Chen International Conference on Learning Representations (ICLR, spotlight), 2024		2024
Branching reinforcement learning Y Du, W Chen International Conference on Machine Learning (ICML), 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–16