JG Yao
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench.
CoRR, October, 2025
FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions.
CoRR, September, 2025