Patrick Leask

According to our database1, Patrick Leask authored at least 4 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models.
CoRR, May, 2025

Sparse Autoencoders Do Not Find Canonical Units of Analysis.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
BatchTopK Sparse Autoencoders.
CoRR, 2024

2023
CoinRun: Solving Goal Misgeneralisation.
CoRR, 2023


  Loading...