Zhiyuan Li

Cited by

	All	Since 2019
Citations	4192	4137
h-index	23	23
i10-index	29	29

1100

550

275

825

2017201820192020202120222023202414 35 243 640 793 966 1064 426

Public access

View all

16 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Sanjeev AroraProfessor of Computer Science, Princeton UniversityVerified email at cs.princeton.edu
Wei HuAssistant Professor of Computer Science and Engineering, University of MichiganVerified email at umich.edu
Simon Shaolei DuAssistant Professor, School of Computer Science and Engineering, University of WashingtonVerified email at cs.washington.edu
Ruosong WangPhD Student, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Kaifeng LyuPrinceton UniversityVerified email at princeton.edu
Ruslan SalakhutdinovUPMC Professor, Machine Learning Department, CMUVerified email at cs.cmu.edu
Dingli YuPrinceton UniversityVerified email at cs.princeton.edu
Srinadh BhojanapalliResearch Scientist, Google ResearchVerified email at google.com
Yann LeCunChief AI Scientist at Facebook & Silver Professor at the Courant Institute, New York UniversityVerified email at cs.nyu.edu
Behnam NeyshaburSenior Staff Research Scientist, DeepMindVerified email at google.com
Nathan SrebroProfessor, TTIC and University of ChicagoVerified email at ttic.edu
Yi ZhangSenior Researcher at Microsoft Research RedmondVerified email at microsoft.com
Eva TardosProfessor of Computer Science, Cornell UniversityVerified email at cornell.edu
Karthik SridharanCornell University, University of Pennsylvania, Toyota Technological InstituteVerified email at ttic.edu
Dylan J. FosterPrincipal Researcher, Microsoft ResearchVerified email at microsoft.com
Thodoris LykourisMITVerified email at mit.edu
Yuping LuoComputer Science Department, Princeton UniversityVerified email at cs.princeton.edu
Rong GeDuke UniversityVerified email at cs.duke.edu
Holden LeeAssistant Professor of Applied Mathematics and Statistics, Johns Hopkins UniversityVerified email at jhu.edu
Xiang WangMeta, Duke UniversityVerified email at meta.com

Zhiyuan Li

Assistant Professor, Toyota Technological Institute at Chicago

Verified email at ttic.edu - Homepage

deep learning theory machine learning theory


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks S Arora, S Du, W Hu, Z Li, R Wang International Conference on Machine Learning, 322-332, 2019	952	2019
On exact computation with an infinitely wide neural net S Arora, SS Du, W Hu, Z Li, RR Salakhutdinov, R Wang Advances in neural information processing systems 32, 2019	896	2019
Towards understanding the role of over-parametrization in generalization of neural networks B Neyshabur, Z Li, S Bhojanapalli, Y LeCun, N Srebro arXiv preprint arXiv:1805.12076, 2018	567	2018
An exponential learning rate schedule for deep learning Z Li, S Arora arXiv preprint arXiv:1910.07454, 2019	185	2019
Harnessing the power of infinitely wide deep nets on small-data tasks S Arora, SS Du, Z Li, R Salakhutdinov, R Wang, D Yu	179	2019
Enhanced convolutional neural tangent kernels Z Li, R Wang, D Yu, SS Du, W Hu, R Salakhutdinov, S Arora arXiv preprint arXiv:1911.00809, 2019	122	2019
Theoretical analysis of auto rate-tuning by batch normalization S Arora, Z Li, K Lyu arXiv preprint arXiv:1812.03981, 2018	118	2018
Learning in games: Robustness of fast convergence DJ Foster, Z Li, T Lykouris, K Sridharan, E Tardos Advances in Neural Information Processing Systems 29, 2016	114	2016
Towards resolving the implicit bias of gradient descent for matrix factorization: Greedy low-rank learning Z Li, Y Luo, K Lyu arXiv preprint arXiv:2012.09839, 2020	110	2020
Explaining landscape connectivity of low-cost solutions for multilayer nets R Kuditipudi, X Wang, H Lee, Y Zhang, Z Li, W Hu, R Ge, S Arora Advances in neural information processing systems 32, 2019	86	2019
Simple and effective regularization methods for training on noisily labeled data with generalization guarantee W Hu, Z Li, D Yu International Conference on Learning Representations (ICLR 2020), 2019	83*	2019
Understanding gradient descent on the edge of stability in deep learning S Arora, Z Li, A Panigrahi International Conference on Machine Learning, 948-1024, 2022	75	2022
What Happens after SGD Reaches Zero Loss?--A Mathematical Framework Z Li, T Wang, S Arora arXiv preprint arXiv:2110.06914, 2021	75	2021
Gradient descent on two-layer nets: Margin maximization and simplicity bias K Lyu, Z Li, R Wang, S Arora Advances in Neural Information Processing Systems 34, 12978-12991, 2021	62	2021
On the validity of modeling sgd with stochastic differential equations (sdes) Z Li, S Malladi, S Arora Advances in Neural Information Processing Systems 34, 12712-12725, 2021	60	2021
Reconciling modern deep learning with traditional optimization analyses: The intrinsic learning rate Z Li, K Lyu, S Arora Advances in Neural Information Processing Systems 33, 14544-14555, 2020	59	2020
Understanding the generalization benefit of normalization layers: Sharpness reduction K Lyu, Z Li, S Arora Advances in Neural Information Processing Systems 35, 34689-34708, 2022	53	2022
Sophia: A scalable stochastic second-order optimizer for language model pre-training H Liu, Z Li, D Hall, P Liang, T Ma arXiv preprint arXiv:2305.14342, 2023	50	2023
How Does Sharpness-Aware Minimization Minimizes Sharpness? K Wen, T Ma, Z Li The Eleventh International Conference on Learning Representations, 2023	49	2023
Risk bounds and rademacher complexity in batch reinforcement learning Y Duan, C Jin, Z Li International Conference on Machine Learning, 2892-2902, 2021	49	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors