Nina Panickssery

According to our database1, Nina Panickssery authored at least 6 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Beyond Data Filtering: Knowledge Localization for Capability Removal in LLMs.
CoRR, December, 2025

Mitigating Many-Shot Jailbreaking.
CoRR, April, 2025

Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Refusal in Language Models Is Mediated by a Single Direction.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024



  Loading...