Peirong Zhang
Orcid: 0000-0002-1857-5473Affiliations:
- South China University of Technology, Guangzhou, China
According to our database1,
Peirong Zhang authored at least 22 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography.
CoRR, January, 2026
DocAligner: Automating the annotation of photographed documents through real-virtual alignment.
Pattern Recognit., 2026
Frequency Mining Empowered by Text Aggregation: A New Perspective on Document Image Tampering Detection.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Based Scalable Typography.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
Int. J. Comput. Vis., July, 2025
Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR.
CoRR, July, 2025
IEEE Trans. Pattern Anal. Mach. Intell., April, 2025
Enhancing document dewarping evaluation: A new metric with improved accuracy and efficiency.
Pattern Recognit. Lett., 2025
HierCode: A lightweight hierarchical codebook for zero-shot Chinese text recognition.
Pattern Recognit., 2025
MegaHan97K: A large-scale dataset for mega-category Chinese character recognition with over 97K categories.
Pattern Recognit., 2025
Towards Real-World Document Specular Highlight Removal: The DocHighlight Dataset and DocSHRNet Method.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025
Capturing More: Learning Multi-Domain Representations for Robust Online Handwriting Verification.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Generalizable Audio Deepfake Detection via Risk-Aware Style Alignment and Structural Empirical Risk Minimization.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
From Pixels to Semantics: A Novel MLLM-Driven Approach for Explainable Tampered Text Detection.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
TongGu-VL: Advancing Visual-Language Understanding in Chinese Classical Studies through Parameter Sensitivity-Guided Instruction Tuning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Generalizable Audio Deepfake Detection via Hierarchical Structure Learning and Feature Whitening in Poincaré sphere.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
Online Writer Retrieval With Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach.
IEEE Trans. Inf. Forensics Secur., 2024
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
M<sup>6</sup>Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
IEEE Access, 2022