Fanyi Pu

Orcid: 0009-0004-5103-4347

According to our database1, Fanyi Pu authored at least 9 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Demystifing Video Reasoning.
CoRR, March, 2026

2025
Scaling Spatial Intelligence with Multimodal Foundation Models.
CoRR, November, 2025

Otter: A Multi-Modal Model With In-Context Instruction Tuning.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025

Memory-Efficient LLM Training by Various-Grained Low-Rank Projection of Gradients.
CoRR, May, 2025

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos.
CoRR, January, 2025

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

2024
WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning.
CoRR, 2024

2023
OtterHD: A High-Resolution Multi-modality Model.
CoRR, 2023

MIMIC-IT: Multi-Modal In-Context Instruction Tuning.
CoRR, 2023


  Loading...