Simon Lermen

Orcid: 0009-0007-8614-0395

According to our database1, Simon Lermen authored at least 10 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Large-scale online deanonymization with LLMs.
CoRR, February, 2026

Evaluating large language models' ability to automate spear phishing.
Expert Syst. Appl., 2026

2025
Can AI Models be Jailbroken to Phish Elderly Victims? An End-to-End Evaluation.
CoRR, November, 2025

Deceptive Automated Interpretability: Language Models Coordinating to Fool Oversight Systems.
CoRR, April, 2025

2024
Evaluating Large Language Models' Capability to Launch Fully Automated Spear Phishing Campaigns: Validated on Human Subjects.
CoRR, 2024

Applying Refusal-Vector Ablation to Llama 3.1 70B Agents.
CoRR, 2024

2023
Exploring the Robustness of Model-Graded Evaluations and Automated Interpretability.
CoRR, 2023

BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B.
CoRR, 2023

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B.
CoRR, 2023

Evaluating Shutdown Avoidance of Language Models in Textual Scenarios.
CoRR, 2023


  Loading...