Qintong Zhang

According to our database1, Qintong Zhang authored at least 14 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale.
CoRR, April, 2026

PEARL: Personalized Streaming Video Understanding Model.
CoRR, March, 2026

BrowseComp-V<sup>3</sup>: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents.
CoRR, February, 2026

Exploring Information Seeking Agent Consolidation.
CoRR, February, 2026

DocDancer: Towards Agentic Document-Grounded Information Seeking.
CoRR, January, 2026

2025
DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM.
CoRR, December, 2025

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition.
CoRR, December, 2025

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation.
CoRR, October, 2025

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing.
CoRR, September, 2025

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Stop Looking for "Important Tokens" in Multimodal Language Models: Duplication Matters More.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Enhanced Dual-Channel Model-Based with Improved Unet++ Network for Landslide Monitoring and Region Extraction in Remote Sensing Images.
Remote. Sens., August, 2024

Testing homogeneity in high dimensional data through random projections.
J. Multivar. Anal., March, 2024

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction.
CoRR, 2024


  Loading...