Follow
Shaofeng Zou
Shaofeng Zou
Assistant Professor, University at Buffalo the State University of New York
Verified email at buffalo.edu - Homepage
Title
Cited by
Cited by
Year
Finite-sample analysis for sarsa with linear function approximation
S Zou, T Xu, Y Liang
NeurIPS 2019, 2019
1882019
Tightening mutual information-based bounds on generalization error
Y Bu, S Zou, VV Veeravalli
IEEE Journal on Selected Areas in Information Theory 1 (1), 121-130, 2020
1872020
Sequential (quickest) change detection: Classical results and new directions
L Xie, S Zou, Y Xie, VV Veeravalli
IEEE Journal on Selected Areas in Information Theory 2 (2), 494-514, 2021
1152021
Two time-scale off-policy TD learning: Non-asymptotic analysis over Markovian samples
T Xu, S Zou, Y Liang
Advances in Neural Information Processing Systems, 10633-10643, 2019
842019
Online robust reinforcement learning with model uncertainty
Y Wang, S Zou
Advances in Neural Information Processing Systems 34, 7193-7206, 2021
832021
Estimation of KL divergence: Optimal minimax rate
Y Bu, S Zou, Y Liang, VV Veeravalli
IEEE Transactions on Information Theory 64 (4), 2648-2674, 2018
792018
Nonparametric Detection of Anomalous Data Streams
S Zou, Y Liang, HV Poor, X Shi
IEEE Transactions on Signal Processing 65 (21), 5785 - 5797, 2017
60*2017
Policy gradient method for robust reinforcement learning
Y Wang, S Zou
International conference on machine learning, 23484-23526, 2022
542022
An information theoretic approach to secret sharing
S Zou, Y Liang, L Lai, S Shamai
IEEE Transactions on Information Theory 61 (6), 3121-3136, 2015
432015
Quickest change detection under transient dynamics: Theory and asymptotic analysis
S Zou, G Fellouris, VV Veeravalli
IEEE Transactions on Information Theory 65 (3), 1397--1412, 2019
42*2019
Quickest detection of dynamic events in networks
S Zou, VV Veeravalli, J Li, D Towsley
IEEE Transactions on Information Theory, 2019
382019
Robust multi-agent reinforcement learning with state uncertainty
S He, S Han, S Su, S Han, S Zou, F Miao
arXiv preprint arXiv:2307.16212, 2023
312023
Faster algorithm and sharper analysis for constrained markov decision process
T Li, Z Guan, S Zou, T Xu, Y Liang, G Lan
Operations Research Letters 54, 107107, 2024
282024
Information-Theoretic Understanding of Population Risk Improvement with Model Compression.
Y Bu, W Gao, S Zou, VV Veeravalli
AAAI, 3300-3307, 2020
28*2020
Sequential algorithms for moving anomaly detection in networks
G Rovatsos, S Zou, VV Veeravalli
Sequential Analysis, 2020
27*2020
Signal processing and machine learning for biomedical big data
E Sejdic, TH Falk
CRC press, 2018
272018
A robust and constrained multi-agent reinforcement learning framework for electric vehicle amod systems
S He, Y Wang, S Han, S Zou, F Miao
arXiv preprint arXiv:2209.08230, 2022
262022
Finite-sample analysis of Greedy-GQ with linear function approximation under Markovian noise
Y Wang, S Zou
Conference on Uncertainty in Artificial Intelligence, 11-20, 2020
252020
Estimation of KL divergence between large-alphabet distributions
Y Bu, S Zou, Y Liang, VV Veeravalli
2016 IEEE International Symposium on Information Theory (ISIT), 1118-1122, 2016
252016
Sample and communication-efficient decentralized actor-critic algorithms with finite-time analysis
Z Chen, Y Zhou, RR Chen, S Zou
International Conference on Machine Learning, 3794-3834, 2022
242022
The system can't perform the operation now. Try again later.
Articles 1–20