Chao Ma

CoRR, 2024

2022

Why self-attention is Natural for Sequence-to-Sequence Problems? A Perspective from Symmetries.

[BibT_eX]

[DOI]

CoRR, 2022

The Multiscale Structure of Neural Network Loss Functions: The Effect on Optimization and Origin.

[BibT_eX]

[DOI]

CoRR, 2022

Provably convergent quasistatic dynamics for mean-field two-player zero-sum games.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

A Riemannian Mean Field Formulation for Two-layer Neural Networks with Batch Normalization.

[BibT_eX]

[DOI]

CoRR, 2021

The Sobolev Regularization Effect of Stochastic Gradient Descent.

[BibT_eX]

[DOI]

CoRR, 2021

On Linear Stability of SGD and Input-Smoothness of Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Qualitative Study of the Dynamic Behavior for Adaptive Gradient Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Mathematical and Scientific Machine Learning, 2021

2020

Heterogeneous Multireference Alignment for Images With Application to 2D Classification in Single Particle Reconstruction.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Achieving Adversarial Robustness Requires An Active Teacher.

[BibT_eX]

[DOI]

CoRR, 2020

Towards a Mathematical Understanding of Neural Network-Based Machine Learning: what we know and what we don't.

[BibT_eX]

[DOI]

CoRR, 2020

Complexity Measures for Neural Networks with General Activation Functions Using Path-based Norms.

[BibT_eX]

[DOI]

Zhong Li

CoRR, 2020

The Quenching-Activation Behavior of the Gradient Descent Dynamics for Two-layer Neural Network Models.

[BibT_eX]

[DOI]

CoRR, 2020

A Mean-field Analysis of Deep ResNet and Beyond: Towards Provable Optimization Via Overparameterization From Depth.

[BibT_eX]

[DOI]

CoRR, 2020

Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

The Slow Deterioration of the Generalization Error of the Random Feature Model.

[BibT_eX]

[DOI]

Proceedings of Mathematical and Scientific Machine Learning, 2020

A Mean Field Analysis Of Deep ResNet And Beyond: Towards Provably Optimization Via Overparameterization From Depth.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

A Priori Estimates of the Generalization Error for Autoencoders.

[BibT_eX]

[DOI]

Zehao Don

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Globally Convergent Levenberg-Marquardt Method for Phase Retrieval.

[BibT_eX]

[DOI]

Xin Liu

Zaiwen Wen

IEEE Trans. Inf. Theory, 2019

Machine Learning from a Continuous Viewpoint.

[BibT_eX]

[DOI]

CoRR, 2019

On the Generalization Properties of Minimum-norm Solutions for Over-parameterized Neural Network Models.

[BibT_eX]

[DOI]

CoRR, 2019

Barron Spaces and the Compositional Function Spaces for Neural Network Models.

[BibT_eX]

[DOI]

CoRR, 2019

Analysis of the Gradient Descent Algorithm for a Deep Neural Network Model with Skip-connections.

[BibT_eX]

[DOI]

CoRR, 2019

A Comparative Analysis of the Optimization and Generalization Property of Two-layer Neural Network and Random Feature Models Under Gradient Descent Dynamics.

[BibT_eX]

[DOI]

CoRR, 2019

A Priori Estimates of the Population Risk for Residual Networks.

[BibT_eX]

[DOI]