Runming He
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions.
CoRR, June, 2025
LogicPuzzleRL: Cultivating Robust Mathematical Reasoning in LLMs via Reinforcement Learning.
CoRR, June, 2025