Zhangwei Gao

According to our database1, Zhangwei Gao authored at least 14 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Intern-S1: A Scientific Multimodal Foundation Model.
CoRR, August, 2025

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models.
CoRR, April, 2025

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning.
CoRR, March, 2025

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Mini-InternVL: a flexible-transfer pocket multi-modal model with 5% parameters and 90% performance.
Vis. Intell., 2024

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling.
CoRR, 2024

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization.
CoRR, 2024

Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance.
CoRR, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
CoRR, 2024

MMInstruct: a high-quality multi-modal instruction tuning dataset with extensive diversity.
Sci. China Inf. Sci., 2024

How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites.
Sci. China Inf. Sci., 2024

ControlLLM: Augment Language Models with Tools by Searching on Graphs.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
ControlLLM: Augment Language Models with Tools by Searching on Graphs.
CoRR, 2023

2022
A Fast Detection Method for Infrared Small Targets in Complex Sea and Sky Background.
Proceedings of the Asia Conference on Algorithms, Computing and Machine Learning, 2022


  Loading...