Follow
Lechao Xiao
Lechao Xiao
Google Brain
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Wide neural networks of any depth evolve as linear models under gradient descent
J Lee, L Xiao, S Schoenholz, Y Bahri, R Novak, J Sohl-Dickstein, ...
Advances in neural information processing systems 32, 2019
10232019
Dynamical isometry and a mean field theory of cnns: How to train 10,000-layer vanilla convolutional neural networks
L Xiao, Y Bahri, J Sohl-Dickstein, S Schoenholz, J Pennington
International Conference on Machine Learning, 5393-5402, 2018
3562018
Bayesian Deep Convolutional Neural Networks with Many Channels are Gaussian Processes
R Novak, L Xiao, Y Bahri, J Lee, G Yang, DA Abolafia, J Pennington, ...
ICLR 2019, 2018
346*2018
Neural tangents: Fast and easy infinite neural networks in python
R Novak, L Xiao, J Hron, J Lee, AA Alemi, J Sohl-Dickstein, ...
arXiv preprint arXiv:1912.02803, 2019
2402019
Dataset distillation with infinitely wide convolutional networks
T Nguyen, R Novak, L Xiao, J Lee
Advances in Neural Information Processing Systems 34, 5186-5198, 2021
1932021
Finite versus infinite neural networks: an empirical study
J Lee, S Schoenholz, J Pennington, B Adlam, L Xiao, R Novak, ...
Advances in Neural Information Processing Systems 33, 15156-15172, 2020
1872020
Provable benefit of orthogonal initialization in optimizing deep linear networks
W Hu, L Xiao, J Pennington
arXiv preprint arXiv:2001.05992, 2020
1282020
Disentangling trainability and generalization in deep neural networks
L Xiao, J Pennington, S Schoenholz
International Conference on Machine Learning, 10462-10472, 2020
103*2020
The surprising simplicity of the early-time learning dynamics of neural networks
W Hu, L Xiao, B Adlam, J Pennington
Advances in Neural Information Processing Systems 33, 17116-17128, 2020
672020
Uniform estimates for bilinear Hilbert transforms and bilinear maximal functions associated to polynomials
X Li, L Xiao
American Journal of Mathematics 138 (4), 907-962, 2016
382016
Beyond human data: Scaling self-training for problem-solving with language models
A Singh, JD Co-Reyes, R Agarwal, A Anand, P Patil, PJ Liu, J Harrison, ...
arXiv preprint arXiv:2312.06585, 2023
302023
Maximal decay inequalities for trilinear oscillatory integrals of convolution type
PT Gressman, L Xiao
Journal of Functional Analysis 271 (12), 3695-3726, 2016
222016
Precise learning curves and higher-order scalings for dot-product kernel regression
L Xiao, H Hu, T Misiakiewicz, Y Lu, J Pennington
Advances in Neural Information Processing Systems 35, 4558-4570, 2022
192022
Endpoint estimates for one-dimensional oscillatory integral operators
L Xiao
Advances in Mathematics 316, 255-291, 2017
192017
Eigenspace restructuring: a principle of space and frequency in neural networks
L Xiao
Conference on Learning Theory, 4888-4944, 2022
172022
Exploring the Uncertainty Properties of Neural Networks' Implicit Priors in the Infinite-Width Limit
B Adlam, J Lee, L Xiao, J Pennington, J Snoek
ICLR, 2020
172020
Small-scale proxies for large-scale transformer training instabilities
M Wortsman, PJ Liu, L Xiao, K Everett, A Alemi, B Adlam, JD Co-Reyes, ...
arXiv preprint arXiv:2309.14322, 2023
162023
Bilinear Hilbert transforms associated with plane curves
J Guo, L Xiao
The Journal of Geometric Analysis 26, 967-995, 2016
162016
Precise learning curves and higher-order scaling limits for dot product kernel regression
L Xiao, J Pennington
arXiv preprint arXiv:2205.14846, 2022
142022
Fast neural kernel embeddings for general activations
I Han, A Zandieh, J Lee, R Novak, L Xiao, A Karbasi
Advances in neural information processing systems 35, 35657-35671, 2022
132022
The system can't perform the operation now. Try again later.
Articles 1–20