Optimal gradient-based algorithms for non-concave bandit optimization B Huang, K Huang, S Kakade, JD Lee, Q Lei, R Wang, J Yang Advances in Neural Information Processing Systems 34, 29101-29115, 2021 | 12 | 2021 |
Going beyond linear rl: Sample efficient neural function approximation B Huang, K Huang, S Kakade, JD Lee, Q Lei, R Wang, J Yang Advances in Neural Information Processing Systems 34, 8968-8983, 2021 | 9 | 2021 |