LIIR: Learning individual intrinsic reward in multi-agent reinforcement learning Y Du, L Han, M Fang, T Dai, J Liu, D Tao Advances in Neural Information Processing Systems (NeurIPS) 32, 4403-4414, 2019 | 140 | 2019 |
Curriculum-guided hindsight experience replay M Fang, T Zhou, Y Du, L Han, Z Zhang Advances in Neural Information Processing Systems (NeurIPS), 2019 | 125 | 2019 |
A review of safe reinforcement learning: Methods, theory and applications S Gu, L Yang, Y Du, G Chen, F Walter, J Wang, Y Yang, A Knoll arXiv preprint arXiv:2205.10330, 2022 | 63 | 2022 |
Learning Correlated Communication Topology in Multi-Agent Reinforcement Learning Y Du, B Liu, V Moens, Z Liu, Z Ren, J Wang, X Chen, H Zhang AAMAS, 2021 | 45 | 2021 |
Enhancing the robustness of neural collaborative filtering systems under malicious attacks Y Du, M Fang, J Yi, C Xu, J Cheng, D Tao IEEE Transactions on Multimedia 21 (3), 555-565, 2018 | 43 | 2018 |
Learning in Nonzero-Sum Stochastic Games with Potentials D Mguni, Y Wu, Y Du, Y Yang, Z Wang, M Li, Y Wen, J Jennings, J Wang ICML, 2021 | 38 | 2021 |
Ordering-Based Causal Discovery with Reinforcement Learning X Wang, Y Du, S Zhu, L Ke, Z Chen, J Hao, J Wang IJCAI, 2021 | 36 | 2021 |
Grid-wise control for multi-agent reinforcement learning in video game ai Y Du*, L Han*, P Sun*, J Xiong, Q Wang, X Sun, H Liu, T Zhang International Conference on Machine Learning (ICML), 2576-2585, 2019 | 35* | 2019 |
Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games Y Xu*, M Fang*, L Chen, Y Du, JT Zhou, C Zhang Advances in Neural Information Processing Systems (NeurIPS) 33, 2020 | 34 | 2020 |
Reinforcement Learning with Multiple Relational Attention for Solving Vehicle Routing Problems Y Xu, M Fang, L Chen, G Xu, Y Du, C Zhang IEEE Transactions on Cybernetics (TCYB), 2021 | 28 | 2021 |
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL R Yang, Y Lu, W Li, H Sun, M Fang, Y Du, X Li, L Han, C Zhang ICLR, 2022 | 27 | 2022 |
Towards query efficient black-box attacks: An input-free perspective Y Du, M Fang, J Yi, J Cheng, D Tao Proceedings of the 11th ACM CCS@Workshop on Artificial Intelligence and …, 2018 | 22 | 2018 |
GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning J Ruan, Y Du, X Xiong, D Xing, X Li, L Meng, H Zhang, J Wang, B Xu AAMAS 2022, 2022 | 19 | 2022 |
Reliable facility systems design subject to edge failures: based on the uncapacitated fixed-charge location problem Y Pan, Y Du, Z Wei American Journal of Operations Research 2014, 2014 | 18 | 2014 |
Privileged matrix factorization for collaborative filtering Y Du, C Xu, D Tao IJCAI International Joint Conference on Artificial Intelligence, 2017 | 14 | 2017 |
MHER: Model-based hindsight experience replay R Yang, M Fang, L Han, Y Du, F Luo, X Li arXiv preprint arXiv:2107.00306, 2021 | 13 | 2021 |
Reinforcement Recommendation with User Multi-aspect Preference X Chen, Y Du, L Xia, J Wang The Web Conference (TheWebConf), 2021 | 13 | 2021 |
Generalization in Text-based Games via Hierarchical Reinforcement Learning Y Xu, M Fang, L Chen, Y Du, C Zhang Findings of EMNLP, 2021 | 10 | 2021 |
Matrix factorization for collaborative budget allocation Y Du, C Xu, D Tao IEEE Transactions on Automation Science and Engineering 15 (4), 1471-1482, 2018 | 10 | 2018 |
Diversity-augmented intrinsic motivation for deep reinforcement learning T Dai, Y Du, M Fang, AA Bharath Neurocomputing 468, 396-406, 2022 | 8 | 2022 |