Haon Park
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Eliciting and Analyzing Emergent Misalignment in State-of-the-Art Large Language Models.
CoRR, August, 2025
When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs.
CoRR, August, 2025
One-Shot is Enough: Consolidating Multi-Turn Attacks into Efficient Single-Turn Prompts for LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025