Emil Ryd

According to our database1, Emil Ryd authored at least 5 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Removing Sandbagging in LLMs by Training with Weak Supervision.
CoRR, April, 2026

2025
Inoculation Prompting: Instructing LLMs to misbehave at train-time improves test-time alignment.
CoRR, October, 2025

Eliciting Secret Knowledge from Language Models.
CoRR, October, 2025

Towards eliciting latent knowledge from LLMs with mechanistic interpretability.
CoRR, May, 2025

Fine Flood Forecasts: Incorporating local data into global models through fine-tuning.
CoRR, April, 2025


  Loading...