Kongcheng Zhang

According to our database1, Kongcheng Zhang authored at least 10 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs.
CoRR, March, 2026

2025
Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following.
CoRR, December, 2025

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning.
CoRR, August, 2025

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning.
CoRR, June, 2025

SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data.
CoRR, May, 2025

A Survey of Direct Preference Optimization.
CoRR, March, 2025

Reasoning with Reinforced Functional Token Tuning.
CoRR, February, 2025

Odyssey : Empowering Minecraft Agents with Open-World Skills.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Odyssey: Empowering Agents with Open-World Skills.
CoRR, 2024


  Loading...