Ruofei Zhu

According to our database1, Ruofei Zhu authored at least 9 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Truncated Proximal Policy Optimization.
CoRR, June, 2025

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning.
CoRR, April, 2025

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks.
CoRR, April, 2025

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback.
CoRR, March, 2025

DAPO: An Open-Source LLM Reinforcement Learning System at Scale.
CoRR, March, 2025

What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret.
CoRR, March, 2025

2020
A Learning Resource Recommendation Model Based on Fusion of Sequential Information.
Proceedings of the Artificial Intelligence and Security - 6th International Conference, 2020

2019
A learning resource recommendation algorithm based on online learning sequential behavior.
Int. J. Wavelets Multiresolution Inf. Process., 2019

2018
A Novel Learning Early-Warning Model Based on Random Forest Algorithm.
Proceedings of the Intelligent Tutoring Systems - 14th International Conference, 2018


  Loading...