Understanding deep learning (still) requires rethinking generalization C Zhang, S Bengio, M Hardt, B Recht, O Vinyals Communications of the ACM 64 (3), 107-115, 2021 | 5768 | 2021 |
Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems T Chen, M Li, Y Li, M Lin, N Wang, M Wang, T Xiao, B Xu, C Zhang, ... arXiv preprint arXiv:1512.01274, 2015 | 2593 | 2015 |
Unsupervised feature selection for multi-cluster data D Cai, C Zhang, X He Proceedings of the 16th ACM SIGKDD international conference on Knowledge …, 2010 | 1220 | 2010 |
Transfusion: Understanding transfer learning for medical imaging M Raghu, C Zhang, J Kleinberg, S Bengio Advances in Neural Information Processing Systems, 2019 | 1029 | 2019 |
Training deep nets with sublinear memory cost T Chen, B Xu, C Zhang, C Guestrin arXiv preprint arXiv:1604.06174, 2016 | 685 | 2016 |
Learning with a Wasserstein loss C Frogner, C Zhang, H Mobahi, M Araya, TA Poggio Advances in neural information processing systems 28, 2015 | 578 | 2015 |
Do vision transformers see like convolutional neural networks? M Raghu, T Unterthiner, S Kornblith, C Zhang, A Dosovitskiy Advances in Neural Information Processing Systems 34, 12116-12128, 2021 | 513 | 2021 |
Machine theory of mind N Rabinowitz, F Perbet, F Song, C Zhang, SMA Eslami, M Botvinick International conference on machine learning, 4218-4227, 2018 | 471 | 2018 |
A study on overfitting in deep reinforcement learning C Zhang, O Vinyals, R Munos, S Bengio arXiv preprint arXiv:1804.06893, 2018 | 390 | 2018 |
What is being transferred in transfer learning? B Neyshabur, H Sedghi, C Zhang Advances in Neural Information Processing Systems, 2020 | 295 | 2020 |
Automated fault detection without seismic processing M Araya-Polo, T Dahlke, C Frogner, C Zhang, T Poggio, D Hohl The Leading Edge 36 (3), 208-214, 2017 | 270 | 2017 |
What neural networks memorize and why: Discovering the long tail via influence estimation V Feldman, C Zhang Advances in Neural Information Processing Systems, Spotlight, 2020 | 198 | 2020 |
Deduplicating training data makes language models better K Lee, D Ippolito, A Nystrom, C Zhang, D Eck, C Callison-Burch, N Carlini arXiv preprint arXiv:2107.06499, 2021 | 171 | 2021 |
Quantifying memorization across neural language models N Carlini, D Ippolito, M Jagielski, K Lee, F Tramer, C Zhang arXiv preprint arXiv:2202.07646, 2022 | 163 | 2022 |
Are all layers created equal? C Zhang, S Bengio, Y Singer The Journal of Machine Learning Research 23 (1), 2930-2957, 2022 | 141 | 2022 |
Theory of deep learning IIb: Optimization properties of SGD C Zhang, Q Liao, A Rakhlin, B Miranda, N Golowich, T Poggio arXiv preprint arXiv:1801.02254, 2018 | 112* | 2018 |
A variance minimization criterion to feature selection using laplacian regularization X He, M Ji, C Zhang, H Bao IEEE transactions on pattern analysis and machine intelligence 33 (10), 2013 …, 2011 | 104 | 2011 |
International Conference on Learning Representations H Zhang, M Cisse, YN Dauphin, D Lopez-Paz ICML, 2018 | 99 | 2018 |
Machine-learning based automated fault detection in seismic traces C Zhang, C Frogner, M Araya-Polo, D Hohl 76th EAGE Conference and Exhibition 2014 2014 (1), 1-5, 2014 | 98 | 2014 |
Unrestricted adversarial examples TB Brown, N Carlini, C Zhang, C Olsson, P Christiano, I Goodfellow arXiv preprint arXiv:1809.08352, 2018 | 83 | 2018 |