We stand with Ukraine

We stand with Ukraine

Peirong Zhang

Orcid: 0000-0002-1857-5473

Affiliations:

South China University of Technology, Guangzhou, China

According to our database¹, Peirong Zhang authored at least 23 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2026

PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography.

[DOI]

,

,

,

,

,

,

,

CoRR, January, 2026

DocAligner: Automating the annotation of photographed documents through real-virtual alignment.

[DOI]

,

,

,

,

,

,

Pattern Recognit., 2026

Draft, Verify, Restore: Self-Refining Historical Inscription Restoration with a Unified MLLM.

[DOI]

,

,

,

,

,

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Frequency Mining Empowered by Text Aggregation: A New Perspective on Document Image Tampering Detection.

[DOI]

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

Smaller But Better: Unifying Layout Generation with Smaller Large Language Models.

[DOI]

,

,

,

,

Int. J. Comput. Vis., July, 2025

Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR.

[DOI]

,

,

,

,

,

,

,

,

CoRR, July, 2025

Privacy-Preserving Biometric Verification With Handwritten Random Digit String.

[DOI]

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., April, 2025

Enhancing document dewarping evaluation: A new metric with improved accuracy and efficiency.

[DOI]

,

,

,

,

Pattern Recognit. Lett., 2025

HierCode: A lightweight hierarchical codebook for zero-shot Chinese text recognition.

[DOI]

,

,

,

,

,

,

,

Pattern Recognit., 2025

MegaHan97K: A large-scale dataset for mega-category Chinese character recognition with over 97K categories.

[DOI]

,

,

,

,

,

Pattern Recognit., 2025

Towards Real-World Document Specular Highlight Removal: The DocHighlight Dataset and DocSHRNet Method.

[DOI]

,

,

,

,

,

Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Capturing More: Learning Multi-Domain Representations for Robust Online Handwriting Verification.

[DOI]

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Generalizable Audio Deepfake Detection via Risk-Aware Style Alignment and Structural Empirical Risk Minimization.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

From Pixels to Semantics: A Novel MLLM-Driven Approach for Explainable Tampered Text Detection.

[DOI]

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

TongGu-VL: Advancing Visual-Language Understanding in Chinese Classical Studies through Parameter Sensitivity-Guided Instruction Tuning.

[DOI]

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Generalizable Audio Deepfake Detection via Hierarchical Structure Learning and Feature Whitening in Poincaré sphere.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Online Writer Retrieval With Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach.

[DOI]

,

IEEE Trans. Inf. Forensics Secur., 2024

TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models.

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks.

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

M<sup>6</sup>Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

RSTC: A New Residual Swin Transformer for Offline Word-Level Writer Identification.

[DOI]

IEEE Access, 2022

Loading...