Dawei Li

Orcid: 0000-0003-0374-3101

Affiliations:
  • University of Illinois at Urbana-Champaign, Department of Industrial and Enterprise Systems Engineering, Coordinated Science Laboratory, Urbana, IL, USA


According to our database1, Dawei Li authored at least 10 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
EMA-Nesterov: Stabilizing Nesterov's Lookahead for Accelerated Deep Learning Optimization.
CoRR, May, 2026

Revisiting the Adam-SGD Gap in LLM Pre-Training: The Role of Large Effective Learning Rates.
CoRR, May, 2026

2023
NTK-SAP: Improving neural network pruning by aligning training dynamics.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Suboptimal Local Minima Exist for Wide Neural Networks with Smooth Activations.
Math. Oper. Res., November, 2022

On the Benefit of Width for Neural Networks: Disappearance of Basins.
SIAM J. Optim., September, 2022

2021
On a Faster R-Linear Convergence Rate of the Barzilai-Borwein Method.
CoRR, 2021

RMSprop converges with proper hyper-parameter.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
The Global Landscape of Neural Networks: An Overview.
IEEE Signal Process. Mag., 2020

2019
Sub-Optimal Local Minima Exist for Almost All Over-parameterized Neural Networks.
CoRR, 2019

2018
Over-Parameterized Deep Neural Networks Have No Strict Local Minima For Any Continuous Activations.
CoRR, 2018


  Loading...