Yang Shi
Affiliations:- Peking University, Beijing, China
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks.
CoRR, June, 2025
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios.
CoRR, May, 2025
CoRR, April, 2025
MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models.
CoRR, April, 2025