James Beetham

According to our database1, James Beetham authored at least 5 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

SafeR-CLIP: Mitigating NSFW Content in Vision-Language Models While Preserving Pre-Trained Knowledge.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2024
LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds.
CoRR, 2024

2023
Dual Student Networks for Data-Free Model Stealing.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Detecting Compromised Architecture/Weights of a Deep Model.
Proceedings of the 26th International Conference on Pattern Recognition, 2022


  Loading...