Tengyu Xu

Cited by

	All	Since 2019
Citations	949	932
h-index	15	15
i10-index	16	16

220

110

165

201720182019202020212022202320246 10 31 87 183 215 218 195

Public access

View all

14 articles

1 article

available

not available

Based on funding mandates

Co-authors

Yingbin LiangThe Ohio State UniversityVerified email at osu.edu
Guanghui (George) LanProfessor, Georgia Institute of TechnologyVerified email at isye.gatech.edu
HV PoorMichael Henry Strater University Professor, Princeton UniversityVerified email at princeton.edu
Zhaoran WangAssistant Professor at Northwestern UniversityVerified email at northwestern.edu

Tengyu Xu

Meta Platforms, Inc.

Verified email at meta.com - Homepage

Reinforcement Learning Deep Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Finite-sample analysis for sarsa with linear function approximation S Zou, T Xu, Y Liang Advances in neural information processing systems 32, 2019	193	2019
Crpo: A new approach for safe reinforcement learning with convergence guarantee T Xu, Y Liang, G Lan International Conference on Machine Learning, 11480-11491, 2021	146*	2021
Improving sample complexity bounds for (natural) actor-critic algorithms T Xu, Z Wang, Y Liang Advances in Neural Information Processing Systems 33, 4358-4369, 2020	137*	2020
Two time-scale off-policy TD learning: Non-asymptotic analysis over Markovian samples T Xu, S Zou, Y Liang Advances in neural information processing systems 32, 2019	84	2019
Reanalysis of variance reduced temporal difference learning T Xu, Z Wang, Y Zhou, Y Liang arXiv preprint arXiv:2001.01898, 2020	46	2020
Enhanced first and zeroth order variance reduced algorithms for min-max optimization T Xu, Z Wang, Y Liang, HV Poor	45*	2020
Algorithms for the estimation of transient surface heat flux during ultra-fast surface cooling ZF Zhou, TY Xu, B Chen International Journal of Heat and Mass Transfer 100, 1-10, 2016	43	2016
Non-asymptotic convergence of adam-type reinforcement learning algorithms under markovian sampling H Xiong, T Xu, Y Liang, W Zhang Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10460 …, 2021	34	2021
Proximal gradient descent-ascent: Variable convergence under k {\L} geometry Z Chen, Y Zhou, T Xu, Y Liang arXiv preprint arXiv:2102.04653, 2021	31	2021
Sample complexity bounds for two timescale value-based reinforcement learning algorithms T Xu, Y Liang International conference on artificial intelligence and statistics, 811-819, 2021	30	2021
Faster algorithm and sharper analysis for constrained Markov decision process T Li, Z Guan, S Zou, T Xu, Y Liang, G Lan Operations Research Letters 54, 107107, 2024	28	2024
Doubly robust off-policy actor-critic: Convergence and optimality T Xu, Z Yang, Z Wang, Y Liang International Conference on Machine Learning, 11581-11591, 2021	27	2021
When will generative adversarial imitation learning algorithms attain global convergence Z Guan, T Xu, Y Liang International Conference on Artificial Intelligence and Statistics, 1117-1125, 2021	24	2021
When Will Gradient Methods Converge to Max-margin Classifier under ReLU Models? T Xu, Y Zhou, K Ji, Y Liang arXiv preprint arXiv:1806.04339, 2018	23*	2018
Model-based offline meta-reinforcement learning with regularization S Lin, J Wan, T Xu, Y Liang, J Zhang arXiv preprint arXiv:2202.02929, 2022	20	2022
Provably efficient offline reinforcement learning with trajectory-wise reward T Xu, Y Wang, S Zou, Y Liang IEEE Transactions on Information Theory, 2024	14	2024
Deterministic policy gradient: Convergence analysis H Xiong, T Xu, L Zhao, Y Liang, W Zhang Uncertainty in Artificial Intelligence, 2159-2169, 2022	9	2022
PER-ETD: A polynomially efficient emphatic temporal difference learning method Z Guan, T Xu, Y Liang arXiv preprint arXiv:2110.06906, 2021	9	2021
A unifying framework of off-policy general value function evaluation T Xu, Z Yang, Z Wang, Y Liang Advances in Neural Information Processing Systems 35, 13570-13583, 2022	4*	2022
Constraint‐based multi‐agent reinforcement learning for collaborative tasks X Shang, T Xu, I Karamouzas, M Kallmann Computer Animation and Virtual Worlds 34 (3-4), e2182, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors