Xinhao Li
Orcid: 0009-0003-0382-3985Affiliations:
- Nanjing University, State Key Laboratory for Novel Software Technology, Nanjing, China
- Shanghai AI Laboratory, OpenGVLab, Shanghai, China
According to our database1,
Xinhao Li
authored at least 17 papers
between 2023 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception.
CoRR, September, 2025
CoRR, June, 2025
CoRR, April, 2025
CoRR, January, 2025
CoRR, January, 2025
CoRR, January, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model.
CoRR, 2024
CoRR, 2024
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
2023
CoRR, 2023