Kongcheng Zhang

According to our database¹, Kongcheng Zhang authored at least 10 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs.

[BibT_eX]

[DOI]

CoRR, March, 2026

2025

Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following.

[BibT_eX]

[DOI]

CoRR, December, 2025

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, August, 2025

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, June, 2025

A Survey of Direct Preference Optimization.

[BibT_eX]

[DOI]

CoRR, March, 2025

Reasoning with Reinforced Functional Token Tuning.

[BibT_eX]

[DOI]

CoRR, February, 2025

SeRL: Self-play Reinforcement Learning for Large Language Models with Limited Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Odyssey : Empowering Minecraft Agents with Open-World Skills.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024

Odyssey: Empowering Agents with Open-World Skills.

[BibT_eX]

[DOI]

CoRR, 2024

Kongcheng Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...