Qi Zheng

Affiliations:

Alibaba Group, Hangzhou, China

According to our database¹, Qi Zheng authored at least 19 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Bi-VLDoc: bidirectional vision-language modeling for visually-rich document understanding.

[BibT_eX]

[DOI]

Int. J. Document Anal. Recognit., December, 2025

Mobile-Agent-v3: Fundamental Agents for GUI Automation.

[BibT_eX]

[DOI]

CoRR, August, 2025

LORE++: Logical location regression network for table structure recognition with pre-training.

[BibT_eX]

[DOI]

Pattern Recognit., 2025

A Simple yet Effective Layout Token in Large Language Models for Document Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Vision Grid Transformer for Document Layout Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GEM: Gestalt Enhanced Markup Language Model for Web Understanding via Render Tree.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LORE: Logical Location Regression Network for Table Structure Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2021

Decoupling Visual-Semantic Feature Learning for Robust Scene Text Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

SentiPrompt: Sentiment Knowledge Enhanced Prompt-Tuning for Aspect-Based Sentiment Analysis.

[BibT_eX]

[DOI]

CoRR, 2021

2020

An End-to-End OCR Text Re-organization Sequence Learning for Rich-Text Detail Image Comprehension.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

SegLink++: Detecting Dense and Arbitrary-shaped Scene Text by Instance-aware Component Grouping.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

2018

ICPR2018 Contest on Robust Reading for Multi-Type Web Images.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Qi Zheng

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...