Laurène Vaugrante

According to our database1, Laurène Vaugrante authored at least 4 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Emergently Misaligned Language Models Show Behavioral Self-Awareness That Shifts With Subsequent Realignment.
CoRR, February, 2026

2025
Compromising Honesty and Harmlessness in Language Models via Deception Attacks.
CoRR, February, 2025

Prompt Engineering Techniques for Language Model Reasoning Lack Replicability.
Trans. Mach. Learn. Res., 2025

2024
A Looming Replication Crisis in Evaluating Behavior in Language Models? Evidence and Solutions.
CoRR, 2024


  Loading...