Kongcheng Zhang

According to our database1, Kongcheng Zhang authored at least 5 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning.
CoRR, June, 2025

SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data.
CoRR, May, 2025

A Survey of Direct Preference Optimization.
CoRR, March, 2025

Reasoning with Reinforced Functional Token Tuning.
CoRR, February, 2025

2024
Odyssey: Empowering Agents with Open-World Skills.
CoRR, 2024


  Loading...