Lifan Yuan

According to our database1, Lifan Yuan authored at least 29 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
RLPR: Extrapolating RLVR to General Domains without Verifiers.
CoRR, June, 2025

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models.
CoRR, May, 2025

The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning.
CoRR, May, 2025

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models.
CoRR, May, 2025

Process Reinforcement through Implicit Rewards.
CoRR, February, 2025

Advancing LLM Reasoning Generalists with Preference Trees.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Free Process Rewards without Process Labels.
CoRR, 2024

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity.
CoRR, 2024

Advancing LLM Reasoning Generalists with Preference Trees.
CoRR, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.
CoRR, 2024

Noise Contrastive Alignment of Language Models with Explicit Rewards.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Executable Code Actions Elicit Better LLM Agents.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Deep Clustering and Visualization for End-to-End High-Dimensional Data Analysis.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Removing Backdoors in Pre-trained Models by Regularized Continual Pre-training.
Trans. Assoc. Comput. Linguistics, 2023

Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown.
CoRR, 2023

UltraFeedback: Boosting Language Models with High-quality Feedback.
CoRR, 2023

Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations.
CoRR, 2023

Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Close Look into the Calibration of Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition.
Proceedings of the 29th International Conference on Computational Linguistics, 2022


  Loading...