Haoran Lian

According to our database¹, Haoran Lian authored at least 9 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

A Unified View of Attention and Residual Sinks: Outlier-Driven Rescaling is Essential for Transformer Training.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

UniAttn: Reducing Inference Costs via Softmax Unification for Post-Training LLMs.

[BibT_eX]

[DOI]

CoRR, February, 2025

LBPE: Long-token-first Tokenization to Improve Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Temporal Scaling Law for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Scaffold-BPE: Enhancing Byte Pair Encoding for Large Language Models with Simple and Effective Scaffold Token Removal.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts.

[BibT_eX]

[DOI]

CoRR, 2024

Scaffold-BPE: Enhancing Byte Pair Encoding with Simple and Effective Scaffold Token Removal.

[BibT_eX]

[DOI]

CoRR, 2024

Temporal Scaling Law for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Haoran Lian

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...