Zhiruo Wang

Orcid: 0000-0002-9839-8410

Affiliations:
  • Carnegie Mellon University, Language Technologies Institute, Pittsburgh, PA, USA
  • Beijing Normal University, School of Mathematical Sciences, China (former)


According to our database1, Zhiruo Wang authored at least 40 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
How Well Does Agent Development Reflect Real-World Work?
CoRR, March, 2026

Modeling Distinct Human Interaction in Web Agents.
CoRR, February, 2026

Vision-as-Inverse-Graphics Agent via Interleaved Multimodal Reasoning.
CoRR, January, 2026

TRUST-CUA: Trustworthy Computer-Using Generalist Agents for Intelligent User Interfaces (Workshop).
Proceedings of the Companion Proceedings of the 31st International Conference on Intelligent User Interfaces, 2026

2025
How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations.
CoRR, October, 2025

TOM-SWE: User Mental Modeling For Software Engineering Agents.
CoRR, October, 2025

ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory.
CoRR, October, 2025

OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety.
CoRR, July, 2025

Universal Retrieval for Multimodal Trajectory Modeling.
CoRR, June, 2025

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills.
CoRR, April, 2025

Inducing Programmatic Skills for Agentic Tasks.
CoRR, April, 2025

Towards Unobtrusive Physical AI: Augmenting Everyday Objects with Intelligence and Robotic Movement for Proactive Assistance.
Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology, 2025

CodeRAG-Bench: Can Retrieval Augment Code Generation?
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Benchmarking Failures in Tool-Augmented Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Agent Workflow Memory.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

RAGGED: Towards Informed Design of Scalable and Stable RAG Systems.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

cAST: Enhancing Code Retrieval-Augmented Generation with Structural Chunking via Abstract Syntax Tree.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

AutoPresent: Designing Structured Visuals from Scratch.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks.
CoRR, 2024

What Are Tools Anyway? A Survey from the Language Model Perspective.
CoRR, 2024

RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems.
CoRR, 2024

Large Language Models for Tabular Data: Progresses and Future Directions.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
StarCoder: may the source be with you!
Trans. Mach. Learn. Res., 2023

Learning to Filter Context for Retrieval-Augmented Generation.
CoRR, 2023

Improving Factuality of Abstractive Summarization via Contrastive Reward Learning.
CoRR, 2023

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Execution-Based Evaluation for Open-Domain Code Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

API-Assisted Code Generation for Question Answering on Varied Table Structures.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

2022
Table Retrieval May Not Necessitate Table-specific Model Design.
CoRR, 2022

Retrieval as Attention: End-to-end Learning of Retrieval and Reading within a Single Transformer.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
TUTA: Tree-based Transformers for Generally Structured Table Pre-training.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

2020
Intrinsic Knowledge Evaluation on Chinese Language Models.
CoRR, 2020

Structure-aware Pre-training for Table Understanding with Tree-based Transformers.
CoRR, 2020

FastBERT: a Self-distilling BERT with Adaptive Inference Time.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

K-BERT: Enabling Language Representation with Knowledge Graph.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020


  Loading...