John Yang

Affiliations:
  • Stanford University, Department of Computer Science, Stanford, CA, USA
  • Princeton University, Department of Computer Science, Princeton, NJ, USA (2021 - 2023)


According to our database1, John Yang authored at least 20 papers between 2022 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
ProgramBench: Can Language Models Rebuild Programs From Scratch?
CoRR, May, 2026

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces.
CoRR, January, 2026

2025
CodeClash: Benchmarking Goal-Oriented Software Engineering.
CoRR, November, 2025

OpenThoughts: Data Recipes for Reasoning Models.
CoRR, June, 2025

LongCodeBench: Evaluating Coding LLMs at 1M Context Windows.
CoRR, May, 2025

SWE-smith: Scaling Data for Software Engineering Agents.
CoRR, April, 2025

MMTEB: Massive Multilingual Text Embedding Benchmark.
CoRR, February, 2025

SWE-smith: Scaling Data for Software Engineering Agents.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

EnIGMA: Interactive Tools Substantially Assist LM Agents in Finding Security Vulnerabilities.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Prompting Large Language Models to Tackle the Full Software Development Lifecycle: A Case Study.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration.
CoRR, 2024

EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges.
CoRR, 2024

DevBench: A Comprehensive Benchmark for Software Development.
CoRR, 2024

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

SWE-bench: Can Language Models Resolve Real-world Github Issues?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Disentangled Prompt Learning for Transferable, Multimodal, Few-Shot Image Classification.
Proceedings of the IEEE International Conference on Big Data, 2024

Referral Augmentation for Zero-Shot Information Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


  Loading...