Minhak Song

Orcid: 0009-0005-4940-8837

According to our database1, Minhak Song authored at least 10 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
AMUSE: Anytime Muon with Stable Gradient Evaluation.
CoRR, May, 2026

Zeroth-Order Optimization at the Edge of Stability.
CoRR, April, 2026

Suspicious Alignment of SGD: A Fine-Grained Step Size Condition Analysis.
CoRR, January, 2026

2025
Implicit Bias of Per-sample Adam on Separable Data: Departure from the Full-batch Regime.
CoRR, October, 2025

Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO.
CoRR, May, 2025

Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Understanding Sharpness Dynamics in NN Training with a Minimalist Example: The Effects of Dataset Difficulty, Depth, Stochasticity, and More.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Does SGD really happen in tiny subspaces?
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Linear attention is (maybe) all you need (to understand Transformer optimization).
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Trajectory Alignment: Understanding the Edge of Stability Phenomenon via Bifurcation Theory.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


  Loading...