Dillon Bowen

Orcid: 0000-0002-3033-1332

According to our database1, Dillon Bowen authored at least 8 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Jailbreak-Tuning: Models Efficiently Learn Jailbreak Susceptibility.
CoRR, July, 2025

The Safety Gap Toolkit: Evaluating Hidden Dangers of Open-Source Models.
CoRR, July, 2025

AI Companies Should Report Pre- and Post-Mitigation Safety Evaluations.
CoRR, March, 2025

Scaling Trends for Data Poisoning in LLMs.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Scaling Laws for Data Poisoning in LLMs.
CoRR, 2024

A StrongREJECT for Empty Jailbreaks.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2022
Multiple Inference: A Python package for comparing multiple parameters.
J. Open Source Softw., 2022

2020
Generalized SHAP: Generating multiple types of explanations in machine learning.
CoRR, 2020


  Loading...