Tao Yu

Orcid: 0000-0001-9939-2216

Affiliations:

University of Hong Kong, Department of Computer Science , Hong Kong
University of Washington, Paul G. Allen School of Computer Science & Engineering, Seattle, WA, USA
Yale University, Department of Computer Science, New Haven, CT, USA (PhD)

According to our database¹, Tao Yu authored at least 74 papers between 2016 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents.

[BibT_eX]

[DOI]

CoRR, May, 2026

CUBE: A Standard for Unifying Agent Benchmarks.

[BibT_eX]

[DOI]

CoRR, March, 2026

2025

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration.

[BibT_eX]

[DOI]

CoRR, November, 2025

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents.

[BibT_eX]

[DOI]

CoRR, October, 2025

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos.

[BibT_eX]

[DOI]

CoRR, October, 2025

OpenCUA: Open Foundations for Computer-Use Agents.

[BibT_eX]

[DOI]

CoRR, August, 2025

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis.

[BibT_eX]

[DOI]

CoRR, May, 2025

Keypoint-Based Registration for Surgical Tool Trajectory Guidance and Intraoperative Risk Alerts.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2025

OpenCUA: Open Foundations for Computer-Use Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Agentic AI for Enterprise: Emerging Applications and Real-world Challenges.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Learn-by-interact: A Data-Centric Framework For Self-Adaptive Agents in Realistic Environments.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Generative Representational Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Attacking Vision-Language Computer Agents via Pop-ups.

[BibT_eX]

[DOI]

Yanzhe Zhang

Tao Yu

Diyi Yang

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

ARKS: Active Retrieval in Knowledge Soup for Code Generation.

[BibT_eX]

[DOI]

CoRR, 2024

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement.

[BibT_eX]

[DOI]

CoRR, 2024

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Lemur: Harmonizing Natural Language and Code for Language Agents.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

EvoR: Evolving Retrieval for Code Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

FOLIO: Natural Language Reasoning with First-Order Logic.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Language Agents: Foundations, Prospects, and Risks.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Bartlomiej Bojanowski

Christopher D. Manning

Daniel Moseguí González

Eunice Engefu Manyasi

Evgenii Zheltonozhskii

Fanyue Xia

Fatemeh Siar

Fernando Martínez-Plumed

Giambattista Parascandolo

Giorgio Mariani

Gloria Wang

Gonzalo Jaimovitch-López

Jaime Fernández Fisac

Jascha Sohl-Dickstein

José Hernández-Orallo

Karthik Gopalakrishnan

Lidia Contreras Ochando

Louis-Philippe Morency

María José Ramírez-Quintana

Michael I. Ivanitskiy

Neta Gur-Ari Krakover

Nitish Shirish Keskar

Pablo Antonio Moreno Casares

Pegah Alipoormolabashi

Shyamolima (Shammie) Debnath

Sneha Priscilla Makini

Yadollah Yaghoobzadeh

Trans. Mach. Learn. Res., 2023

OpenAgents: An Open Platform for Language Agents in the Wild.

[BibT_eX]

[DOI]

CoRR, 2023

Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations.

[BibT_eX]

[DOI]

CoRR, 2023

Automated Self-Supervised Learning for Recommendation.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2023, 2023

Coder Reviewer Reranking for Code Generation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Compositional Exemplars for In-context Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Selective Annotation Makes Language Models Better Few-Shot Learners.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Binding Language Models in Symbolic Languages.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Generating Data for Symbolic Language with Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Batch Prompting: Efficient Inference with Large Language Model APIs.

[BibT_eX]

[DOI]

Zhoujun Cheng

Jungo Kasai

Tao Yu

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Complex Reasoning in Natural Languag.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 6: Tutorial Abstracts), 2023

One Embedder, Any Task: Instruction-Finetuned Text Embeddings.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

When Geometric Deep Learning Meets Pretrained Protein Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

NL2INTERFACE: Interactive Visualization Interface Generation from Natural Language Queries.

[BibT_eX]

[DOI]

CoRR, 2022

FOLIO: Natural Language Reasoning with First-Order Logic.

[BibT_eX]

[DOI]

CoRR, 2022

ZeroGen: Efficient Zero-shot Learning via Dataset Generation.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

In-Context Learning for Few-Shot Dialogue State Tracking.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

DYLE: Dynamic Latent Extraction for Abstractive Long-Input Summarization.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Dragomir R. Radev

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Prefix-to-SQL: Text-to-SQL Generation from Incomplete User Questions.

[BibT_eX]

[DOI]

CoRR, 2021

End-to-End Cross-Domain Text-to-SQL Semantic Parsing with Auxiliary Task.

[BibT_eX]

[DOI]

CoRR, 2021

QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

DART: Open-Domain Structured Data Record to Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Proceedings of the 9th International Conference on Learning Representations, 2021

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Testing Cross-Database Semantic Parsers With Canonical Utterances.

[BibT_eX]

[DOI]

Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

Effective Fine-Tuning Methods for Cross-lingual Adaptation.

[BibT_eX]

[DOI]

Tao Yu

Shafiq R. Joty

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

SummerTime: Text Summarization Toolkit for Non-experts.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Dragomir R. Radev

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

An Exploratory Study on Long Dialogue Summarization: What Works and What's Next.

[BibT_eX]

[DOI]

Ahmed Hassan Awadallah

Dragomir R. Radev

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Logic-Consistency Text Generation from Semantic Parses.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Did You Ask a Good Question? A Cross-Domain Question Intention Classification Benchmark for Text-to-SQL.

[BibT_eX]

[DOI]

CoRR, 2020

DART: Open-Domain Structured Data Record to Text Generation.

[BibT_eX]

[DOI]

CoRR, 2020

Semantic Evaluation for Text-to-SQL with Distilled Test Suites.

[BibT_eX]

[DOI]

Ruiqi Zhong

Tao Yu

Dan Klein

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Online Conversation Disentanglement with Pointer Networks.

[BibT_eX]

[DOI]

Tao Yu

Shafiq R. Joty

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

SParC: Cross-Domain Semantic Parsing in Context.

[BibT_eX]

[DOI]