Zelin Tan
Orcid: 0000-0002-6855-6852Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Scaling Behaviors of LLM Reinforcement Learning Post-Training: An Empirical Study in Mathematical Reasoning.
CoRR, September, 2025
CoRR, September, 2025
ACM Trans. Asian Low Resour. Lang. Inf. Process., February, 2025