Huanjin Yao
According to our database1,
Huanjin Yao authored at least 19 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CoRR, March, 2026
Advancing Multimodal Judge Models through a Capability-Oriented Benchmark and MCTS-Driven Data Generation.
CoRR, March, 2026
CoLoGen: Progressive Learning of Concept-Localization Duality for Unified Image Generation.
CoRR, February, 2026
R1-SyntheticVL: Is Synthetic Data from Generative Models Ready for Multimodal Large Language Model?
CoRR, February, 2026
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.
CoRR, January, 2026
2025
CoRR, October, 2025
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI.
CoRR, June, 2025
CoRR, May, 2025
R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO.
CoRR, May, 2025
Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation.
CoRR, January, 2025
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-Wise Group Relative Policy Optimization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
2024
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search.
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
2023
Side4Video: Spatial-Temporal Side Network for Memory-Efficient Image-to-Video Transfer Learning.
CoRR, 2023