Xingtai Lv

According to our database1, Xingtai Lv authored at least 23 papers between 2023 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Post-Trained MoE Can Skip Half Experts via Self-Distillation.
CoRR, May, 2026

How Far Can Unsupervised RLVR Scale LLM Training?
CoRR, March, 2026

Process Reinforcement through Implicit Rewards.
Trans. Mach. Learn. Res., 2026

2025
FlowRL: Matching Reward Distributions for LLM Reasoning.
CoRR, September, 2025

A Survey of Reinforcement Learning for Large Reasoning Models.
CoRR, September, 2025

Towards a Unified View of Large Language Model Post-Training.
CoRR, September, 2025

Automating Exploratory Multiomics Research via Language Models.
CoRR, June, 2025

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models.
CoRR, March, 2025

Process Reinforcement through Implicit Rewards.
CoRR, February, 2025

How to Synthesize Text Data without Model Collapse?
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Fusing Highly Specialized Language Models for Comprehensive Expertise.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Automating Exploratory Proteomics Research via Language Models.
CoRR, 2024

Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
CoRR, 2024

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process.
CoRR, 2024

Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models.
CoRR, 2024

UltraMedical: Building Specialized Generalists in Biomedicine.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Sparse Low-rank Adaptation of Pre-trained Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Parameter-efficient Weight Ensembling Facilitates Task-level Knowledge Transfer.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2023


  Loading...