Follow
Behnam Neyshabur
Behnam Neyshabur
Member of Technical Staff, Anthropic
Verified email at anthropic.com - Homepage
Title
Cited by
Cited by
Year
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
22132023
Exploring generalization in deep learning
B Neyshabur, S Bhojanapalli, D McAllester, N Srebro
Advances in Neural Information Processing Systems, 2017
14712017
Sharpness-Aware Minimization for Efficiently Improving Generalization
P Foret, A Kleiner, H Mobahi, B Neyshabur
International Conference on Learning Representations, 2021
14232021
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
Transactions on Machine Learning Research, 2023
11772023
In Search of the Real Inductive Bias: On the Role of Implicit Regularization in Deep Learning
B Neyshabur, R Tomioka, N Srebro
International Conference on Learning Representations, 2015
7442015
Stronger generalization bounds for deep nets via a compression approach
S Arora, R Ge, B Neyshabur, Y Zhang
The 35th International Conference on Machine Learning, 2018
7092018
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
7022024
A pac-bayesian approach to spectrally-normalized margin bounds for neural networks
B Neyshabur, S Bhojanapalli, N Srebro
International Conference on Learning Representations, 2018
6882018
Fantastic Generalization Measures and Where to Find Them
Y Jiang, B Neyshabur, H Mobahi, D Krishnan, S Bengio
International Conference on Learning Representations, 2020
6702020
Norm-Based Capacity Control in Neural Networks
B Neyshabur, R Tomioka, N Srebro
Conference on Learning Theory, 1376–1401, 2015
6632015
Towards understanding the role of over-parametrization in generalization of neural networks
B Neyshabur, Z Li, S Bhojanapalli, Y LeCun, N Srebro
International Conference on Learning Representations, 2019
6232019
Solving quantitative reasoning problems with language models
A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ...
Advances in Neural Information Processing Systems, 2022
6052022
Implicit regularization in matrix factorization
S Gunasekar, BE Woodworth, S Bhojanapalli, B Neyshabur, N Srebro
Advances in neural information processing systems 30, 2017
5582017
What is being transferred in transfer learning?
B Neyshabur, H Sedghi, C Zhang
Advances in Neural Information Processing Systems, 2020
5432020
Global Optimality of Local Search for Low Rank Matrix Recovery
S Bhojanapalli, B Neyshabur, N Srebro
Advances in Neural Information Processing Systems, 2016
4592016
Predicting protein–protein interactions through sequence-based deep learning
S Hashemifar, B Neyshabur, AA Khan, J Xu
Bioinformatics 34 (17), i802-i810, 2018
3562018
Path-SGD: Path-Normalized Optimization in Deep Neural Networks
B Neyshabur, RR Salakhutdinov, N Srebro
Advances in Neural Information Processing Systems, 2413-2421, 2015
3412015
On Symmetric and Asymmetric LSHs for Inner Product Search
B Neyshabur, N Srebro
The 32nd International Conference on Machine Learning, 1926–1934, 2015
2232015
NETAL: a new graph-based method for global alignment of protein–protein interaction networks
B Neyshabur, A Khadem, S Hashemifar, SS Arab
Bioinformatics 29 (13), 1654-1662, 2013
2122013
Gemma 2: Improving open language models at a practical size
G Team, M Riviere, S Pathak, PG Sessa, C Hardin, S Bhupatiraju, ...
arXiv preprint arXiv:2408.00118, 2024
2032024
The system can't perform the operation now. Try again later.
Articles 1–20