Lifan Yuan

According to our database¹, Lifan Yuan authored at least 33 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments.

[BibT_eX]

[DOI]

CoRR, November, 2025

Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark.

[BibT_eX]

[DOI]

CoRR, September, 2025

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones.

[BibT_eX]

[DOI]

CoRR, September, 2025

RLPR: Extrapolating RLVR to General Domains without Verifiers.

[BibT_eX]

[DOI]

CoRR, June, 2025

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

Process Reinforcement through Implicit Rewards.

[BibT_eX]

[DOI]

CoRR, February, 2025

Free Process Rewards without Process Labels.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Advancing LLM Reasoning Generalists with Preference Trees.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Free Process Rewards without Process Labels.

[BibT_eX]

[DOI]

CoRR, 2024

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity.

[BibT_eX]

[DOI]

CoRR, 2024

Advancing LLM Reasoning Generalists with Preference Trees.

[BibT_eX]

[DOI]

CoRR, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

Noise Contrastive Alignment of Language Models with Explicit Rewards.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Executable Code Actions Elicit Better LLM Agents.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Deep Clustering and Visualization for End-to-End High-Dimensional Data Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., November, 2023

Removing Backdoors in Pre-trained Models by Regularized Continual Pre-training.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2023

Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown.

[BibT_eX]

[DOI]

CoRR, 2023

UltraFeedback: Boosting Language Models with High-quality Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations.

[BibT_eX]

[DOI]

CoRR, 2023

Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Close Look into the Calibration of Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Lifan Yuan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...