Peirong Zhang

Orcid: 0000-0002-1857-5473

Affiliations:
  • South China University of Technology, Guangzhou, China


According to our database1, Peirong Zhang authored at least 22 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography.
CoRR, January, 2026

DocAligner: Automating the annotation of photographed documents through real-virtual alignment.
Pattern Recognit., 2026

Frequency Mining Empowered by Text Aggregation: A New Perspective on Document Image Tampering Detection.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Smaller But Better: Unifying Layout Generation with Smaller Large Language Models.
Int. J. Comput. Vis., July, 2025

Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR.
CoRR, July, 2025

Privacy-Preserving Biometric Verification With Handwritten Random Digit String.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2025

Enhancing document dewarping evaluation: A new metric with improved accuracy and efficiency.
Pattern Recognit. Lett., 2025

HierCode: A lightweight hierarchical codebook for zero-shot Chinese text recognition.
Pattern Recognit., 2025

MegaHan97K: A large-scale dataset for mega-category Chinese character recognition with over 97K categories.
Pattern Recognit., 2025

Towards Real-World Document Specular Highlight Removal: The DocHighlight Dataset and DocSHRNet Method.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Capturing More: Learning Multi-Domain Representations for Robust Online Handwriting Verification.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Generalizable Audio Deepfake Detection via Risk-Aware Style Alignment and Structural Empirical Risk Minimization.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

From Pixels to Semantics: A Novel MLLM-Driven Approach for Explainable Tampered Text Detection.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

TongGu-VL: Advancing Visual-Language Understanding in Chinese Classical Studies through Parameter Sensitivity-Guided Instruction Tuning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Generalizable Audio Deepfake Detection via Hierarchical Structure Learning and Feature Whitening in Poincaré sphere.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Online Writer Retrieval With Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach.
IEEE Trans. Inf. Forensics Secur., 2024

TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
M<sup>6</sup>Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
RSTC: A New Residual Swin Transformer for Offline Word-Level Writer Identification.
IEEE Access, 2022


  Loading...