Qi Zheng

Affiliations:
  • Alibaba Group, Hangzhou, China


According to our database1, Qi Zheng authored at least 18 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
LORE++: Logical location regression network for table structure recognition with pre-training.
Pattern Recognit., 2025

A Simple yet Effective Layout Token in Large Language Models for Document Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding.
CoRR, 2024

DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Vision Grid Transformer for Document Layout Analysis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

GEM: Gestalt Enhanced Markup Language Model for Web Understanding via Render Tree.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LORE: Logical Location Regression Network for Table Structure Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding.
CoRR, 2022

2021
Decoupling Visual-Semantic Feature Learning for Robust Scene Text Recognition.
CoRR, 2021

SentiPrompt: Sentiment Knowledge Enhanced Prompt-Tuning for Aspect-Based Sentiment Analysis.
CoRR, 2021

2020
An End-to-End OCR Text Re-organization Sequence Learning for Rich-Text Detail Image Comprehension.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
SegLink++: Detecting Dense and Arbitrary-shaped Scene Text by Instance-aware Component Grouping.
Pattern Recognit., 2019

2018
ICPR2018 Contest on Robust Reading for Multi-Type Web Images.
Proceedings of the 24th International Conference on Pattern Recognition, 2018


  Loading...