James Beetham

According to our database¹, James Beetham authored at least 5 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Jailbreaks as Inference-Time Alignment: A Framework for Understanding Safety Failures in LLMs.

[BibT_eX]

[DOI]

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

SafeR-CLIP: Mitigating NSFW Content in Vision-Language Models While Preserving Pre-Trained Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2024

LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Dual Student Networks for Data-Free Model Stealing.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Detecting Compromised Architecture/Weights of a Deep Model.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Pattern Recognition, 2022

James Beetham

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...