Min Dou
According to our database1,
Min Dou
authored at least 23 papers
between 2022 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
InternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language Models.
CoRR, June, 2025
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models.
CoRR, April, 2025
CoRR, January, 2025
2024
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling.
CoRR, 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes.
CoRR, 2024
CoRR, 2024
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models.
CoRR, 2024
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
CoRR, 2024
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond.
CoRR, 2024
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning.
CoRR, 2024
OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving.
CoRR, 2024
How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites.
Sci. China Inf. Sci., 2024
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
LimSim++: A Closed-Loop Platform for Deploying Multimodal LLMs in Autonomous Driving.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Remote. Sens., June, 2023
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving.
CoRR, 2023
Proceedings of the 26th IEEE International Conference on Intelligent Transportation Systems, 2023
2022
ADAS: A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation.
CoRR, 2022