Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

OS-ATLAS: Foundation Action Model for Generalist GUI Agents.

[BibT_eX]

[DOI]

Zhiyong Wu

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

CapArena: Benchmarking and Analyzing Detailed Image Captioning in the LLM Era.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond.

[BibT_eX]

[DOI]

CoRR, 2024

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model.

[BibT_eX]

[DOI]

CoRR, 2023

Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022

ADS-Cap: A Framework for Accurate and Diverse Stylized Captioning with Unpaired Stylistic Corpora.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2022

Kanzhi Cheng

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...