Libin Zhu

Cited by

	All	Since 2019
Citations	491	489
h-index	6	6
i10-index	3	3

180

135

2020202120222023202415 50 131 172 121

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Mikhail (Misha) BelkinProfessor of Data Science, Halıcıoğlu Data Science Institute, CSE, UCSD, Amazon ScholarVerified email at ucsd.edu
Chaoyue LiuUniversity of California, San DiegoVerified email at ucsd.edu
Adityanarayanan RadhakrishnanBroad Institute of MIT and HarvardVerified email at mit.edu
Pedro Cisneros-VelardeVMware ResearchVerified email at vmware.com
Arindam BanerjeeFounder Professor, Dept of Computer Science, University of Illinois Urbana-ChampaignVerified email at illinois.edu
Parthe PanditThakur Family Chair Assistant Professor @ IIT BombayVerified email at iitb.ac.in

Libin Zhu

University of Washington

Verified email at uw.edu - Homepage

Mathematical foundations of deep learning Optimization


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Loss landscapes and optimization in over-parameterized non-linear systems and neural networks C Liu, L Zhu, M Belkin Applied and Computational Harmonic Analysis 59, 85-116, 2022	287*	2022
On the linearity of large non-linear models: when and why the tangent kernel is constant C Liu, L Zhu, M Belkin Advances in Neural Information Processing Systems 33, 15954-15964, 2020	156	2020
Quadratic models for understanding catapult dynamics of neural networks L Zhu, C Liu, A Radhakrishnan, M Belkin The Twelfth International Conference on Learning Representations, 0	18*
Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning L Zhu, C Liu, A Radhakrishnan, M Belkin arXiv preprint arXiv:2306.04815, 2023	8	2023
Restricted strong convexity of deep learning models with smooth activations A Banerjee, P Cisneros-Velarde, L Zhu, M Belkin arXiv preprint arXiv:2209.15106, 2022	7	2022
Transition to linearity of general neural networks with directed acyclic graph architecture L Zhu, C Liu, M Belkin Advances in neural information processing systems 35, 5363-5375, 2022	6	2022
Neural tangent kernel at initialization: linear width suffices A Banerjee, P Cisneros-Velarde, L Zhu, M Belkin Uncertainty in Artificial Intelligence, 110-118, 2023	5	2023
Transition to linearity of wide neural networks is an emerging property of assembling weak models C Liu, L Zhu, M Belkin arXiv preprint arXiv:2203.05104, 2022	4	2022
Toward Understanding the Dynamics of Over-parameterized Neural Networks L Zhu University of California, San Diego, 2024		2024
A note on Linear Bottleneck networks and their Transition to Multilinearity L Zhu, P Pandit, M Belkin arXiv preprint arXiv:2206.15058, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–10

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors