Binghai Wang
According to our database1,
Binghai Wang authored at least 12 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
EVPO: Explained Variance Policy Optimization for Adaptive Critic Utilization in LLM Post-Training.
CoRR, April, 2026
MM-Doc-R1: Training Agents for Long Document Visual Question Answering through Multi-turn Reinforcement Learning.
CoRR, April, 2026
CoRR, March, 2026
CoRR, February, 2026
Proceedings of the ACM Web Conference 2026, 2026
2025
CoRR, November, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023