Sanjeevan Selvaganapathy
According to our database1,
Sanjeevan Selvaganapathy authored at least 4 papers
between 2025 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Do LLMs Use Cultural Knowledge Without Being Told? A Multilingual Evaluation of Implicit Pragmatic Adaptation.
CoRR, April, 2026
Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026
Confident, Calibrated, or Complicit: Safety Alignment and Ideological Bias in LLM Hate Speech Detection.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026
2025
Confident, Calibrated, or Complicit: Probing the Trade-offs between Safety Alignment and Ideological Bias in Language Models in Detecting Hate Speech.
CoRR, September, 2025