Hao-Jun Michael Shi

According to our database1, Hao-Jun Michael Shi authored at least 15 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Clarifying Shampoo: Adapting Spectral Descent to Stochasticity and the Parameter Trajectory.
CoRR, February, 2026

Adaptive Batch Sizes Using Non-Euclidean Gradient Noise Scales for Stochastic Sign and Spectral Descent.
CoRR, February, 2026

2025
Smoothing DiLoCo with Primal Averaging for Faster Training of LLMs.
CoRR, December, 2025

SNOO: Step-K Nesterov Outer Optimizer - The Surprising Effectiveness of Nesterov Momentum Applied to Pseudo-Gradients.
CoRR, October, 2025

Purifying Shampoo: Investigating Shampoo's Heuristics by Decomposing its Preconditioner.
CoRR, June, 2025

2023
On the numerical performance of finite-difference-based methods for derivative-free optimization.
Optim. Methods Softw., March, 2023

A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale.
CoRR, 2023

2022
Adaptive Finite-Difference Interval Estimation for Noisy Derivative-Free Optimization.
SIAM J. Sci. Comput., August, 2022

A Noise-Tolerant Quasi-Newton Algorithm for Unconstrained Optimization.
SIAM J. Optim., 2022

2020
Compositional Embeddings Using Complementary Partitions for Memory-Efficient Recommendation Systems.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

2019
Deep Learning Recommendation Model for Personalization and Recommendation Systems.
CoRR, 2019

2018
Optimizing Quantization for Lasso Recovery.
IEEE Signal Process. Lett., 2018

A Progressive Batching L-BFGS Method for Machine Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

2016
Practical Algorithms for Learning Near-Isometric Linear Embeddings.
CoRR, 2016

Methods for quantized compressed sensing.
Proceedings of the 2016 Information Theory and Applications Workshop, 2016


  Loading...