Wei Li

Affiliations:

East China Normal University, Shanghai, China
Shanghai AI Laboratory, China (2022 - 2025)
Shanghai Jiao Tong University, China (former)

According to our database¹, Wei Li authored at least 38 papers between 2016 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

MMRareBench: A Rare-Disease Multimodal and Multi-Image Medical Benchmark.

[BibT_eX]

[DOI]

CoRR, April, 2026

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale.

[BibT_eX]

[DOI]

CoRR, April, 2026

RAR: Retrieving and Ranking Augmented MLLMs for Visual Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2026

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and a Comprehensive Multimodal Dataset Towards General Medical AI.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification.

[BibT_eX]

[DOI]

CoRR, December, 2025

UniMedVL: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis.

[BibT_eX]

[DOI]

CoRR, October, 2025

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing.

[BibT_eX]

[DOI]

CoRR, September, 2025

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers.

[BibT_eX]

[DOI]

CoRR, August, 2025

F^2TTA: Free-Form Test-Time Adaptation on Cross-Domain Medical Image Classification via Image-Level Disentangled Prompt Tuning.

[BibT_eX]

[DOI]

CoRR, July, 2025

MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation.

[BibT_eX]

[DOI]

CoRR, May, 2025

Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model.

[BibT_eX]

[DOI]

CoRR, May, 2025

WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages.

[BibT_eX]

[DOI]

CoRR, January, 2025

MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group Relative Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

RetinaLogos: Fine-Grained Synthesis of High-Resolution Retinal Images Through Captions.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.

[BibT_eX]

[DOI]

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

OpenHuEval: Evaluating Large Language Model on Hungarian Specifics.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions.

[BibT_eX]

[DOI]

CoRR, 2024

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.

[BibT_eX]

[DOI]

CoRR, 2024

MinerU: An Open-Source Solution for Precise Document Content Extraction.

[BibT_eX]

[DOI]

CoRR, 2024

OpenDataLab: Empowering General Artificial Intelligence with Open Datasets.

[BibT_eX]

[DOI]

CoRR, 2024

Investigating Public Fine-Tuning Datasets: A Complex Review of Current Practices from a Construction Perspective.

[BibT_eX]

[DOI]

Runyuan Ma

Wei Li

Fukai Shang

CoRR, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output.

[BibT_eX]

[DOI]

CoRR, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.

[BibT_eX]

[DOI]

CoRR, 2024

FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

InternLM2 Technical Report.

[BibT_eX]

[DOI]

et al.

CoRR, 2024

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset.

[BibT_eX]

[DOI]

CoRR, 2024

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model.

[BibT_eX]

[DOI]

CoRR, 2024

How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2024

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

VIGC: Visual Instruction Generation and Correction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.

[BibT_eX]

[DOI]

CoRR, 2023

MiChao-HuaFen 1.0: A Specialized Pre-trained Corpus Dataset for Domain-specific Large Models.

[BibT_eX]

[DOI]

CoRR, 2023

WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models.

[BibT_eX]

[DOI]

CoRR, 2023

2016

CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016.

[BibT_eX]

[DOI]

CoRR, 2016

Wei Li

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...