Summer Yue

According to our database1, Summer Yue authored at least 16 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems.
CoRR, March, 2025

Jailbreaking to Jailbreak.
CoRR, February, 2025

EnigmaEval: A Benchmark of Long Multimodal Reasoning Challenges.
CoRR, February, 2025

Planning in Natural Language Improves LLM Search for Code Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Aligned LLMs Are Not Aligned Browser Agents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents.
CoRR, 2024

Planning In Natural Language Improves LLM Search For Code Generation.
CoRR, 2024

LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet.
CoRR, 2024

A Careful Examination of Large Language Model Performance on Grade School Arithmetic.
CoRR, 2024

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning.
CoRR, 2024

A Careful Examination of Large Language Model Performance on Grade School Arithmetic.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024


2022
Scalability and Generalization of Circuit Training for Chip Floorplanning.
Proceedings of the ISPD 2022: International Symposium on Physical Design, Virtual Event, Canada, March 27, 2022

Differentiable Architecture Search for Reinforcement Learning.
Proceedings of the International Conference on Automated Machine Learning, 2022

2021
RL-DARTS: Differentiable Architecture Search for Reinforcement Learning.
CoRR, 2021


  Loading...