Jun Tang

Orcid: 0009-0006-3949-007X

Affiliations:
  • Alibaba Group, Beijing, China
  • Huazhong University of Science and Technology, School of Electronics Information and Communications, Wuhan, China


According to our database1, Jun Tang authored at least 9 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models.
CoRR, February, 2025

Qwen2.5-VL Technical Report.
CoRR, February, 2025

2024
CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy.
CoRR, 2024

VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Platypus: A Generalized Specialist Model for Reading Text in Various Forms.
Proceedings of the Computer Vision - ECCV 2024, 2024

2022
Vision-Language Pre-Training for Boosting Scene Text Detectors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter.
CoRR, 2021

MOST: A Multi-Oriented Scene Text Detector With Localization Refinement.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2019
SegLink++: Detecting Dense and Arbitrary-shaped Scene Text by Instance-aware Component Grouping.
Pattern Recognit., 2019


  Loading...