Haon Park
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, September, 2025
X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates.
CoRR, September, 2025
ObjexMT: Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge under Multi-Turn Jailbreaks.
CoRR, August, 2025
Eliciting and Analyzing Emergent Misalignment in State-of-the-Art Large Language Models.
CoRR, August, 2025
When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs.
CoRR, August, 2025
One-Shot is Enough: Consolidating Multi-Turn Attacks into Efficient Single-Turn Prompts for LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025