Humen Zhong

Orcid: 0009-0002-8676-0811

According to our database1, Humen Zhong authored at least 8 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Qwen3-VL-Seg: Unlocking Open-World Referring Segmentation with Vision-Language Grounding.
CoRR, May, 2026

2025
Qwen2.5-VL Technical Report.
CoRR, February, 2025

CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Platypus: A Generalized Specialist Model for Reading Text in Various Forms.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2021
ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter.
CoRR, 2021

MOST: A Multi-Oriented Scene Text Detector With Localization Refinement.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021


  Loading...