Yang Shi
Orcid: 0009-0003-9241-236XAffiliations:
- Peking University, Beijing, China
According to our database1,
Yang Shi authored at least 26 papers
between 2025 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
CoRR, April, 2026
CoRR, April, 2026
VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining.
CoRR, March, 2026
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models.
CoRR, February, 2026
Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks.
CoRR, February, 2026
DiaDem: Advancing Dialogue Descriptions in Audiovisual Video Captioning for Multimodal Large Language Models.
CoRR, January, 2026
CoRR, January, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models.
CoRR, December, 2025
CoRR, December, 2025
Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling.
CoRR, December, 2025
The Unseen Bias: How Norm Discrepancy in Pre-Norm MLLMs Leads to Visual Information Loss.
CoRR, December, 2025
CoRR, November, 2025
When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMs.
CoRR, November, 2025
CoRR, October, 2025
CoRR, October, 2025
CoRR, September, 2025
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark.
CoRR, September, 2025
VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks.
CoRR, June, 2025
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios.
CoRR, May, 2025
MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models.
CoRR, April, 2025
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025