Haoran Lian

According to our database1, Haoran Lian authored at least 9 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
A Unified View of Attention and Residual Sinks: Outlier-Driven Rescaling is Essential for Transformer Training.
CoRR, January, 2026

2025
UniAttn: Reducing Inference Costs via Softmax Unification for Post-Training LLMs.
CoRR, February, 2025

LBPE: Long-token-first Tokenization to Improve Large Language Models.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Temporal Scaling Law for Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts.
CoRR, 2024

Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal.
CoRR, 2024

Temporal Scaling Law for Large Language Models.
CoRR, 2024


  Loading...