Wanjia Zhao
According to our database1,
Wanjia Zhao authored at least 16 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning.
CoRR, May, 2026
ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs.
CoRR, March, 2026
CoRR, February, 2026
2025
CoRR, October, 2025
CoRR, July, 2025
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition.
CoRR, April, 2025
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
2024
CoRR, 2024
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search.
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
2023
Positive Distribution Pollution: Rethinking Positive Unlabeled Learning from a Unified Perspective.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023