Zhenwei Shao

Orcid: 0009-0005-3069-9347

According to our database¹, Zhenwei Shao authored at least 10 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

CGC: Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding.

[BibT_eX]

[DOI]

CoRR, April, 2026

2025

MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning.

[BibT_eX]

[DOI]

CoRR, December, 2025

VideoARM: Agentic Reasoning over Hierarchical Memory for Long-Form Video Understanding.

[BibT_eX]

[DOI]

CoRR, December, 2025

Prophet: Prompting Large Language Models With Complementary Answer Heuristics for Knowledge-Based Visual Question Answering.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

Imp: Highly Capable Large Multimodal Models for Mobile Devices.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

Benchmarking and Enhancing Geospatial Visual Reasoning Over Street Maps.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2025

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

Growing a Twig to Accelerate Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024

Imp: Highly Capable Large Multimodal Models for Mobile Devices.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Prompting Large Language Models with Answer Heuristics for Knowledge-Based Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Zhenwei Shao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...