Hao Feng
Orcid: 0000-0001-8127-6639Affiliations:
- University of Science and Technology of China, Department of Electronic Engineering and Information Science, CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System, Hefei, China
According to our database1,
Hao Feng authored at least 45 papers
between 2021 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
ACM Trans. Multim. Comput. Commun. Appl., April, 2026
TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering.
CoRR, February, 2026
CoRR, February, 2026
CoRR, January, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
ChineseVideoBench: Benchmarking Multi-modal Large Models for Chinese Video Question Answering.
CoRR, November, 2025
IEEE Trans. Circuits Syst. Video Technol., September, 2025
Benchmarking Vision-Language Models on Chinese Ancient Documents: From OCR to Knowledge Reasoning.
CoRR, September, 2025
Int. J. Comput. Vis., August, 2025
Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning.
CoRR, May, 2025
WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?
CoRR, May, 2025
CoRR, March, 2025
IEEE Trans. Multim., 2025
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
A Bounding Box is Worth One Token - Interleaving Layout and Text in a Large Language Model for Document Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
IEEE Trans. Circuits Syst. Video Technol., September, 2024
IEEE Trans. Circuits Syst. Video Technol., June, 2024
Comput. Vis. Image Underst., January, 2024
CoRR, 2024
RoFIR: Robust Fisheye Image Rectification Framework Impervious to Optical Center Deviation.
CoRR, 2024
CoRR, 2024
DocPedia: unleashing the power of large multimodal model in the frequency domain for versatile document understanding.
Sci. China Inf. Sci., 2024
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024
2023
IEEE Trans. Image Process., 2023
Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs.
CoRR, 2023
DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding.
CoRR, 2023
UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding.
CoRR, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
2022
PolyTracker: Progressive Contour Regression for Multiple Object Tracking and Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021