An-Lan Wang

Orcid: 0009-0002-5449-6438

According to our database1, An-Lan Wang authored at least 15 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering.
CoRR, February, 2026

TechCoach: Towards Technical-Point-Aware Descriptive Action Coaching.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
ChineseVideoBench: Benchmarking Multi-modal Large Models for Chinese Video Question Answering.
CoRR, November, 2025

Benchmarking Vision-Language Models on Chinese Ancient Documents: From OCR to Knowledge Reasoning.
CoRR, September, 2025

WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?
CoRR, May, 2025

Task-Oriented 6-DoF Grasp Pose Detection in Clutters.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Advancing Sequential Numerical Prediction in Autoregressive Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

ParGo: Bridging Vision-Language with Partial and Global Views.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching.
CoRR, 2024

MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark.
CoRR, 2024

EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Event-Guided Procedure Planning from Instructional Videos with Text Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023


  Loading...