John Yang

Affiliations:

Stanford University, Department of Computer Science, Stanford, CA, USA
Princeton University, Department of Computer Science, Princeton, NJ, USA (2021 - 2023)

According to our database¹, John Yang authored at least 20 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

ProgramBench: Can Language Models Rebuild Programs From Scratch?

[BibT_eX]

[DOI]

CoRR, May, 2026

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces.

[BibT_eX]

[DOI]

Estefany Kelly Buchanan

Orfeas Menis-Mastromichalakis

Gabriel H. S. Dreiman

Shreyas Pimpalgaonkar

Arinbjörn Kolbeinsson

Jesse Hu

Christopher Michael Rytting

CoRR, January, 2026

2025

CodeClash: Benchmarking Goal-Oriented Software Engineering.

[BibT_eX]

[DOI]

CoRR, November, 2025

OpenThoughts: Data Recipes for Reasoning Models.

[BibT_eX]

[DOI]

Shreyas Pimpalgaonkar

Maheswaran Sathiamoorthy

Alexandros G. Dimakis

Ludwig Schmidt

CoRR, June, 2025

LongCodeBench: Evaluating Coding LLMs at 1M Context Windows.

[BibT_eX]

[DOI]

CoRR, May, 2025

SWE-smith: Scaling Data for Software Engineering Agents.

[BibT_eX]

[DOI]

CoRR, April, 2025

MMTEB: Massive Multilingual Text Embedding Benchmark.

[BibT_eX]

[DOI]

Kenneth C. Enevoldsen

Hippolyte Gisserot-Boukhlef

Lester James V. Miranda

CoRR, February, 2025

SWE-smith: Scaling Data for Software Engineering Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

EnIGMA: Interactive Tools Substantially Assist LM Agents in Finding Security Vulnerabilities.

[BibT_eX]

[DOI]

Prashanth Krishnamurthy

Brendan Dolan-Gavitt

Muhammad Shafique

Karthik R. Narasimhan

Ramesh Karri

Ofir Press

Proceedings of the Forty-second International Conference on Machine Learning, 2025

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

[BibT_eX]

[DOI]

Karthik R. Narasimhan

Diyi Yang

Sida Wang

Ofir Press

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Prompting Large Language Models to Tackle the Full Software Development Lifecycle: A Case Study.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024

Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration.

[BibT_eX]

[DOI]

CoRR, 2024

EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges.

[BibT_eX]

[DOI]

Prashanth Krishnamurthy

CoRR, 2024

DevBench: A Comprehensive Benchmark for Software Development.

[BibT_eX]

[DOI]

CoRR, 2024

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

SWE-bench: Can Language Models Resolve Real-world Github Issues?

[BibT_eX]

[DOI]

Karthik R. Narasimhan

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Disentangled Prompt Learning for Transferable, Multimodal, Few-Shot Image Classification.

[BibT_eX]

[DOI]

John Yang

Alessandro Magnani

Binwei Yang

Proceedings of the IEEE International Conference on Big Data, 2024

Referral Augmentation for Zero-Shot Information Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

John Yang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...