Maximilian Li

According to our database1, Maximilian Li authored at least 3 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Endless Jailbreaks with Bijection Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Optimal ablation for interpretability.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Circuit Breaking: Removing Model Behaviors with Targeted Ablation.
CoRR, 2023


  Loading...