Qintong Zhang

According to our database¹, Qintong Zhang authored at least 14 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale.

[BibT_eX]

[DOI]

CoRR, April, 2026

PEARL: Personalized Streaming Video Understanding Model.

[BibT_eX]

[DOI]

CoRR, March, 2026

BrowseComp-V<sup>3</sup>: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents.

[BibT_eX]

[DOI]

CoRR, February, 2026

Exploring Information Seeking Agent Consolidation.

[BibT_eX]

[DOI]

CoRR, February, 2026

DocDancer: Towards Agentic Document-Grounded Information Seeking.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

DOCR-Inspector: Fine-Grained and Automated Evaluation of Document Parsing with VLM.

[BibT_eX]

[DOI]

CoRR, December, 2025

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition.

[BibT_eX]

[DOI]

CoRR, December, 2025

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation.

[BibT_eX]

[DOI]

CoRR, October, 2025

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing.

[BibT_eX]

[DOI]

CoRR, September, 2025

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Stop Looking for "Important Tokens" in Multimodal Language Models: Duplication Matters More.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024

Enhanced Dual-Channel Model-Based with Improved Unet++ Network for Landslide Monitoring and Region Extraction in Remote Sensing Images.

[BibT_eX]

[DOI]

Remote. Sens., August, 2024

Testing homogeneity in high dimensional data through random projections.

[BibT_eX]

[DOI]

J. Multivar. Anal., March, 2024

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction.

[BibT_eX]

[DOI]

Qintong Zhang

Victor Shea-Jay Huang

CoRR, 2024

Qintong Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...