Euan Ong

According to our database1, Euan Ong authored at least 6 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Auditing language models for hidden objectives.
CoRR, March, 2025

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming.
CoRR, January, 2025

2024
Compact Proofs of Model Performance via Mechanistic Interpretability.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Image Hijacks: Adversarial Images can Control Generative Models at Runtime.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Successor Heads: Recurring, Interpretable Attention Heads In The Wild.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2022
Learnable Commutative Monoids for Graph Neural Networks.
Proceedings of the Learning on Graphs Conference, 2022


  Loading...