Samuel Simko

According to our database1, Samuel Simko authored at least 3 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering.
CoRR, February, 2026

2025
Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability.
CoRR, May, 2025

Improving Large Language Model Safety with Contrastive Representation Learning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025


  Loading...