Zaiyuan Wang
According to our database1,
Zaiyuan Wang authored at least 10 papers
between 2025 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
WorldTravel: A Realistic Multimodal Travel-Planning Benchmark with Tightly Coupled Constraints.
CoRR, February, 2026
2025
LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics.
CoRR, December, 2025
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents.
CoRR, December, 2025
CoRR, November, 2025
FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning.
CoRR, September, 2025
Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?
CoRR, September, 2025
CoRR, August, 2025
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025