Jiapeng Wang

Orcid: 0000-0002-2060-3488

Affiliations:
  • South China University of Technology, Guangzhou, China


According to our database1, Jiapeng Wang authored at least 17 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
LiLTv2: Language-substitutable Layout-image Transformer for Visual Information Extraction.
ACM Trans. Multim. Comput. Commun. Appl., March, 2025

Hallucination-Aware Prompt Optimization for Text-to-Video Synthesis.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding.
CoRR, 2024

PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

PPTSER: A Plug-and-Play Tag-guided Method for Few-shot Semantic Entity Recognition on Visually-rich Documents.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval.
CoRR, 2023

Towards Better Translations from Classical to Modern Chinese: A New Dataset and a New Method.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

CocaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

2022
ChaCo: Character Contrastive Learning for Handwritten Text Recognition.
Proceedings of the Frontiers in Handwriting Recognition - 18th International Conference, 2022

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

CMT-Co: Contrastive Learning with Character Movement Task for Handwritten Text Recognition.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Improving Machine Understanding of Human Intent in Charts.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

2020
Precise detection of Chinese characters in historical documents with deep reinforcement learning.
Pattern Recognit., 2020


  Loading...