Panet: Few-shot image semantic segmentation with prototype alignment K Wang, JH Liew, Y Zou, D Zhou, J Feng Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 1271 | 2019 |
Understanding and Resolving Performance Degradation in Deep Graph Convolutional Networks K Zhou, Y Dong, K Wang, WS Lee, B Hooi, H Xu, J Feng Proceedings of the 30th ACM International Conference on Information …, 2021 | 138* | 2021 |
Improving generalization in reinforcement learning with mixture regularization K Wang, B Kang, J Shao, J Feng Advances in Neural Information Processing Systems 33, 7968-7978, 2020 | 121 | 2020 |
Efficient Value Iteration for s-rectangular Robust Markov Decision Processes N Kumar, K Wang, KY Levy, S Mannor Forty-first International Conference on Machine Learning, 0 | 20* | |
Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing K Wang, K Zhou, Q Zhang, J Shao, B Hooi, J Feng International Conference on Machine Learning, 11003-11012, 2021 | 19 | 2021 |
Neural epitome search for architecture-agnostic network compression D Zhou, X Jin, Q Hou, K Wang, J Yang, J Feng arXiv preprint arXiv:1907.05642, 2019 | 18* | 2019 |
Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments K Wang, K Zhou, B Kang, J Feng, YAN Shuicheng The Eleventh International Conference on Learning Representations, 0 | 11* | |
Relational reasoning via set transformers: Provable efficiency and applications to MARL F Zhang, B Liu, K Wang, V Tan, Z Yang, Z Wang Advances in Neural Information Processing Systems 35, 35825-35838, 2022 | 9 | 2022 |
The geometry of robust value functions K Wang, N Kumar, K Zhou, B Hooi, J Feng, S Mannor International Conference on Machine Learning, 22727-22751, 2022 | 6 | 2022 |
Tyger: Task-Type-Generic Active Learning for Molecular Property Prediction K Zhou, K Wang, J Feng, J Tang, T Xu, X Wang arXiv preprint arXiv:2205.11279, 2022 | 2 | 2022 |
Jointly Modelling Uncertainty and Diversity for Active Molecular Property Prediction K Zhou, K Wang, J Tang, J Feng, B Hooi, P Zhao, T Xu, X Wang Learning on Graphs Conference, 29: 1-29: 21, 2022 | 1 | 2022 |
Policy Gradient for Reinforcement Learning with General Utilities N Kumar, K Wang, K Levy, S Mannor arXiv preprint arXiv:2210.00991, 2022 | 1 | 2022 |
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel U Gadot, K Wang, N Kumar, KY Levy, S Mannor Forty-first International Conference on Machine Learning, 0 | 1* | |
Improving Token-Based World Models with Parallel Observation Prediction L Cohen, K Wang, B Kang, S Mannor arXiv preprint arXiv:2402.05643, 2024 | | 2024 |
C-Procgen: Empowering Procgen with Controllable Contexts Z Tan, K Wang, X Wang arXiv preprint arXiv:2311.07312, 2023 | | 2023 |
PPG reloaded: an empirical study on what matters in phasic policy gradient K Wang, D Zhou, J Feng, S Mannor | | 2023 |
Reachability-Aware Laplacian Representation in Reinforcement Learning K Wang, K Zhou, J Feng, B Hooi, X Wang arXiv preprint arXiv:2210.13153, 2022 | | 2022 |
Q-Learning for Lp Robust Markov Decision Processes N Kumar, K Wang, K Levy, S Mannor | | 2022 |
Learning the Uncertainty Set in Robust Markov Decision Process N Kumar, K Wang, U Gadot, KY Levy, S Mannor The Second Tiny Papers Track at ICLR 2024, 0 | | |
Targeted Uncertainty Reduction in Robust MDPs U Gadot, K Wang, E Derman, N Kumar, K Levy, S Mannor NeurIPS 2023 Workshop on Generalization in Planning, 0 | | |