Samuel Simko

According to our database¹, Samuel Simko authored at least 3 papers between 2025 and 2026.

Collaborative distances:

Timeline

Book In proceedings Article PhD thesis Dataset Other

2026

TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering.

[DOI]

CoRR, February, 2026

2025

Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability.

[DOI]

CoRR, May, 2025

Improving Large Language Model Safety with Contrastive Representation Learning.

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025