Yu Yue

According to our database1, Yu Yue authored at least 14 papers between 2013 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Truncated Proximal Policy Optimization.
CoRR, June, 2025

PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier.
CoRR, June, 2025

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning.
CoRR, April, 2025

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks.
CoRR, April, 2025

A Unified Pairwise Framework for RLHF: Bridging Generative Reward Modeling and Policy Optimization.
CoRR, April, 2025

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback.
CoRR, March, 2025

DAPO: An Open-Source LLM Reinforcement Learning System at Scale.
CoRR, March, 2025

What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret.
CoRR, March, 2025

Multitarget Natural Compounds for Ischemic Stroke Treatment: Integration of Deep Learning Prediction and Experimental Validation.
J. Chem. Inf. Model., 2025

2024
A Survey on Natural Language Counterfactual Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Multi-scenario Learning MPC for Automated Driving in Unknown and Changing Environments.
Proceedings of the 21st IEEE International Conference on Industrial Informatics, 2023

2022
Time complexity analysis of quantum difference methods for the multiscale transport equations.
CoRR, 2022

2013
DAS: A dynamic assignment scheduling algorithm for stream computing in distributed applications.
Proceedings of the 2013 IEEE Global Communications Conference, 2013


  Loading...