Summer Yue

According to our database¹, Summer Yue authored at least 18 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Best Practices for Biorisk Evaluations on Open-Weight Bio-Foundation Models.

[BibT_eX]

[DOI]

Boyi Wei

Zora Che

Nathaniel Li

Udari Madhushani Sehwag

CoRR, October, 2025

Remote Labor Index: Measuring AI Automation of Remote Work.

[BibT_eX]

[DOI]

CoRR, October, 2025

The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems.

[BibT_eX]

[DOI]

CoRR, March, 2025

Jailbreaking to Jailbreak.

[BibT_eX]

[DOI]

CoRR, February, 2025

EnigmaEval: A Benchmark of Long Multimodal Reasoning Challenges.

[BibT_eX]

[DOI]

CoRR, February, 2025

Planning in Natural Language Improves LLM Search for Code Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Aligned LLMs Are Not Aligned Browser Agents.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs.

[BibT_eX]

[DOI]

Kaustubh Deshpande

Ved Sirdeshmukh

Johannes Baptist Mols

Lifeng Jin

Ed-Yeremai Hernandez-Cardona

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents.

[BibT_eX]

[DOI]

CoRR, 2024

Planning In Natural Language Improves LLM Search For Code Generation.

[BibT_eX]

[DOI]

CoRR, 2024

LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet.

[BibT_eX]

[DOI]

CoRR, 2024

A Careful Examination of Large Language Model Performance on Grade School Arithmetic.

[BibT_eX]

[DOI]

CoRR, 2024

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning.

[BibT_eX]

[DOI]

Ann-Kathrin Dombrowski

Justin Tienken-Harder

Kallol Krishna Karmakar

Steven Basart

Stephen Fitz

Mindy Levine

Ponnurangam Kumaraguru

CoRR, 2024

A Careful Examination of Large Language Model Performance on Grade School Arithmetic.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

The WMDP Benchmark: Measuring and Reducing Malicious Use with Unlearning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2022

Scalability and Generalization of Circuit Training for Chip Floorplanning.

[BibT_eX]

[DOI]

Proceedings of the ISPD 2022: International Symposium on Physical Design, Virtual Event, Canada, March 27, 2022

Differentiable Architecture Search for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Automated Machine Learning, 2022

2021

RL-DARTS: Differentiable Architecture Search for Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Summer Yue

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...