Leon Eshuijs

Orcid: 0009-0007-8393-7083

According to our database1, Leon Eshuijs authored at least 5 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Safety Training Modulates Harmful Misalignment Under On-Policy RL, But Direction Depends on Environment Design.
CoRR, April, 2026

2025
But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors.
CoRR, May, 2025

Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification.
CoRR, May, 2025

Automatic Evaluation Metrics for Artificially Generated Scientific Research.
CoRR, March, 2025

2024
Balancing the Scales: Reinforcement Learning for Fair Classification.
CoRR, 2024


  Loading...