Anna Soligo

Orcid: 0009-0009-1444-890X

According to our database1, Anna Soligo authored at least 9 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Gemma Needs Help: Investigating and Mitigating Emotional Instability in LLMs.
CoRR, March, 2026

Emergent Misalignment is Easy, Narrow Misalignment is Hard.
CoRR, February, 2026

2025
Emergent misalignment as prompt sensitivity: A research note.
CoRR, July, 2025

An LLM's Apology: Outsourcing Awkwardness in the Age of AI.
CoRR, June, 2025

Convergent Linear Representations of Emergent Misalignment.
CoRR, June, 2025

Model Organisms for Emergent Misalignment.
CoRR, June, 2025

Induced Modularity and Community Detection for Functionally Interpretable Reinforcement Learning.
CoRR, January, 2025

Explainable Reinforcement and Causal Learning for Improving Trust to 6G Stakeholders.
IEEE Open J. Commun. Soc., 2025

Inducing, Detecting and Characterising Neural Modules: A Pipeline for Functional Interpretability in Reinforcement Learning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025


  Loading...