Follow
Kaixin Wang
Kaixin Wang
Microsoft Research
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Panet: Few-shot image semantic segmentation with prototype alignment
K Wang, JH Liew, Y Zou, D Zhou, J Feng
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
12712019
Understanding and Resolving Performance Degradation in Deep Graph Convolutional Networks
K Zhou, Y Dong, K Wang, WS Lee, B Hooi, H Xu, J Feng
Proceedings of the 30th ACM International Conference on Information …, 2021
138*2021
Improving generalization in reinforcement learning with mixture regularization
K Wang, B Kang, J Shao, J Feng
Advances in Neural Information Processing Systems 33, 7968-7978, 2020
1212020
Efficient Value Iteration for s-rectangular Robust Markov Decision Processes
N Kumar, K Wang, KY Levy, S Mannor
Forty-first International Conference on Machine Learning, 0
20*
Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing
K Wang, K Zhou, Q Zhang, J Shao, B Hooi, J Feng
International Conference on Machine Learning, 11003-11012, 2021
192021
Neural epitome search for architecture-agnostic network compression
D Zhou, X Jin, Q Hou, K Wang, J Yang, J Feng
arXiv preprint arXiv:1907.05642, 2019
18*2019
Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments
K Wang, K Zhou, B Kang, J Feng, YAN Shuicheng
The Eleventh International Conference on Learning Representations, 0
11*
Relational reasoning via set transformers: Provable efficiency and applications to MARL
F Zhang, B Liu, K Wang, V Tan, Z Yang, Z Wang
Advances in Neural Information Processing Systems 35, 35825-35838, 2022
92022
The geometry of robust value functions
K Wang, N Kumar, K Zhou, B Hooi, J Feng, S Mannor
International Conference on Machine Learning, 22727-22751, 2022
62022
Tyger: Task-Type-Generic Active Learning for Molecular Property Prediction
K Zhou, K Wang, J Feng, J Tang, T Xu, X Wang
arXiv preprint arXiv:2205.11279, 2022
22022
Jointly Modelling Uncertainty and Diversity for Active Molecular Property Prediction
K Zhou, K Wang, J Tang, J Feng, B Hooi, P Zhao, T Xu, X Wang
Learning on Graphs Conference, 29: 1-29: 21, 2022
12022
Policy Gradient for Reinforcement Learning with General Utilities
N Kumar, K Wang, K Levy, S Mannor
arXiv preprint arXiv:2210.00991, 2022
12022
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel
U Gadot, K Wang, N Kumar, KY Levy, S Mannor
Forty-first International Conference on Machine Learning, 0
1*
Improving Token-Based World Models with Parallel Observation Prediction
L Cohen, K Wang, B Kang, S Mannor
arXiv preprint arXiv:2402.05643, 2024
2024
C-Procgen: Empowering Procgen with Controllable Contexts
Z Tan, K Wang, X Wang
arXiv preprint arXiv:2311.07312, 2023
2023
PPG reloaded: an empirical study on what matters in phasic policy gradient
K Wang, D Zhou, J Feng, S Mannor
2023
Reachability-Aware Laplacian Representation in Reinforcement Learning
K Wang, K Zhou, J Feng, B Hooi, X Wang
arXiv preprint arXiv:2210.13153, 2022
2022
Q-Learning for Lp Robust Markov Decision Processes
N Kumar, K Wang, K Levy, S Mannor
2022
Learning the Uncertainty Set in Robust Markov Decision Process
N Kumar, K Wang, U Gadot, KY Levy, S Mannor
The Second Tiny Papers Track at ICLR 2024, 0
Targeted Uncertainty Reduction in Robust MDPs
U Gadot, K Wang, E Derman, N Kumar, K Levy, S Mannor
NeurIPS 2023 Workshop on Generalization in Planning, 0
The system can't perform the operation now. Try again later.
Articles 1–20