Haebin Seong
According to our database1,
Haebin Seong
authored at least 4 papers
between 2024 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, May, 2025
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems.
CoRR, 2024