Mou Sun

According to our database1, Mou Sun authored at least 10 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Grouter: Decoupling Routing from Representation for Accelerated MoE Training.
CoRR, March, 2026

Practical FP4 Training for Large-Scale MoE Models on Hopper GPUs.
CoRR, March, 2026

Accelerating LLM Pre-Training through Flat-Direction Dynamics Enhancement.
CoRR, February, 2026

Synergistic Intra- and Cross-Layer Regularization Losses for MoE Expert Specialization.
CoRR, February, 2026

Expected Maximization of a Concave Utility Function Under Threshold-Based Activation.
Axioms, 2026

AutoHAAP: Automated Heterogeneity-Aware Asymmetric Partitioning for LLM Training.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026

2025
FP8-Flow-MoE: A Casting-Free FP8 Recipe without Double Quantization Error.
CoRR, November, 2025

MeCeFO: Enhancing LLM Training Robustness via Fault-Tolerant Optimization.
CoRR, October, 2025

2024
Decomposition Methods for Global Solution of Mixed-Integer Linear Programs.
SIAM J. Optim., 2024

2023
MindOpt Adapter for CPLEX Benchmarking Performance Analysis.
CoRR, 2023


  Loading...