Zhenwei Shao

Orcid: 0009-0005-3069-9347

According to our database1, Zhenwei Shao authored at least 10 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
CGC: Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding.
CoRR, April, 2026

2025
MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning.
CoRR, December, 2025

VideoARM: Agentic Reasoning over Hierarchical Memory for Long-Form Video Understanding.
CoRR, December, 2025

Prophet: Prompting Large Language Models With Complementary Answer Heuristics for Knowledge-Based Visual Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

Imp: Highly Capable Large Multimodal Models for Mobile Devices.
IEEE Trans. Multim., 2025

Benchmarking and Enhancing Geospatial Visual Reasoning Over Street Maps.
IEEE Trans. Geosci. Remote. Sens., 2025

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

Growing a Twig to Accelerate Large Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Imp: Highly Capable Large Multimodal Models for Mobile Devices.
CoRR, 2024

2023
Prompting Large Language Models with Answer Heuristics for Knowledge-Based Visual Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023


  Loading...