Nora Petrova

According to our database1, Nora Petrova authored at least 6 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
The Missing Red Line: How Commercial Pressure Erodes AI Safety Boundaries.
CoRR, March, 2026

Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework.
CoRR, March, 2026

Pressure Reveals Character: Behavioural Alignment Evaluation at Depth.
CoRR, February, 2026

2025
Latent Adversarial Training Improves the Representation of Refusal.
CoRR, April, 2025

2024
Characterizing stable regions in the residual stream of LLMs.
CoRR, 2024

Evaluating Synthetic Activations composed of SAE Latents in GPT-2.
CoRR, 2024


  Loading...