Yiyuan Ma

Orcid: 0009-0000-0693-577X

According to our database1, Yiyuan Ma authored at least 14 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws.
CoRR, May, 2026

SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning.
CoRR, February, 2026

MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss.
CoRR, December, 2025

GatePro: Parameter-Free Expert Selection Optimization for Mixture-of-Experts Models.
CoRR, October, 2025

Balanced Actor Initialization: Stable RLHF Training of Distillation-Based Reasoning Models.
CoRR, September, 2025

Model Merging in Pre-training of Large Language Models.
CoRR, May, 2025

MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production.
CoRR, May, 2025

Model Merging in Pre-training of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Design and Optimization of an Origami-Inspired Foldable Pneumatic Actuator.
IEEE Robotics Autom. Lett., February, 2024

2022
One-Dimensional Photonic Crystal Filter with Multiple Defect Layers Based on Particle Swarm Optimization.
Proceedings of the Methods and Applications for Modeling and Simulation of Complex Systems, 2022

2020
Acu-Net: A 3D Attention Context U-Net for Multiple Sclerosis Lesion Segmentation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2018
Hypertension Warning Model Based on Random Forest and Distance Metrics.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018


  Loading...