Yucheng Lu

Affiliations:
  • New York University Shanghai, China


According to our database1, Yucheng Lu authored at least 16 papers between 2020 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Decentralized Learning: Theoretical Optimality and Practical Improvements.
J. Mach. Learn. Res., 2023

Scale up with Order: Finding Good Data Permutations for Distributed Training.
CoRR, 2023

CD-GraB: Coordinating Distributed Example Orders for Provably Accelerated Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CocktailSGD: Fine-tuning Foundation Models over 500Mbps Networks.
Proceedings of the International Conference on Machine Learning, 2023

STEP: Learning N: M Structured Sparsity Masks from Scratch with Precondition.
Proceedings of the International Conference on Machine Learning, 2023

Maximizing Communication Efficiency for Large-scale Training via 0/1 Adam.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
GraB: Finding Provably Better Data Permutations than Random Reshuffling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A General Analysis of Example-Selection for Stochastic Gradient Descent.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Variance Reduction in Training Forecasting Models with Subgroup Sampling.
CoRR, 2021

Hyperparameter Optimization Is Deceiving Us, and How to Stop It.
CoRR, 2021

Hyperparameter Optimization Is Deceiving Us, and How to Stop It.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Optimal Complexity in Decentralized Training.
Proceedings of the 38th International Conference on Machine Learning, 2021

Variance Reduced Training with Stratified Sampling for Forecasting Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Towards Optimal Convergence Rate in Decentralized Stochastic Training.
CoRR, 2020

MixML: A Unified Analysis of Weakly Consistent Parallel Learning.
CoRR, 2020

Moniqua: Modulo Quantized Communication in Decentralized SGD.
Proceedings of the 37th International Conference on Machine Learning, 2020


  Loading...