Xingtai Lv

According to our database¹, Xingtai Lv authored at least 23 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Post-Trained MoE Can Skip Half Experts via Self-Distillation.

[BibT_eX]

[DOI]

CoRR, May, 2026

How Far Can Unsupervised RLVR Scale LLM Training?

[BibT_eX]

[DOI]

CoRR, March, 2026

Process Reinforcement through Implicit Rewards.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

2025

FlowRL: Matching Reward Distributions for LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, September, 2025

A Survey of Reinforcement Learning for Large Reasoning Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

Towards a Unified View of Large Language Model Post-Training.

[BibT_eX]

[DOI]

CoRR, September, 2025

Automating Exploratory Multiomics Research via Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

Process Reinforcement through Implicit Rewards.

[BibT_eX]

[DOI]

CoRR, February, 2025

How to Synthesize Text Data without Model Collapse?

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Fusing Highly Specialized Language Models for Comprehensive Expertise.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Automating Exploratory Proteomics Research via Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.

[BibT_eX]

[DOI]

CoRR, 2024

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process.

[BibT_eX]

[DOI]

CoRR, 2024

Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

UltraMedical: Building Specialized Generalists in Biomedicine.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Sparse Low-rank Adaptation of Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Parameter-efficient Weight Ensembling Facilitates Task-level Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2023

Xingtai Lv

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...