Alice Rigg

According to our database1, Alice Rigg authored at least 6 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Distribution-Aware Feature Selection for SAEs.
CoRR, August, 2025

Detecting and Characterizing Planning in Language Models.
CoRR, August, 2025

Converting MLPs into Polynomials in Closed Form.
CoRR, February, 2025

Bilinear MLPs enable weight-based mechanistic interpretability.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Bilinear Convolution Decomposition for Causal RL Interpretability.
CoRR, 2024

Weight-based Decomposition: A Case for Bilinear MLPs.
CoRR, 2024


  Loading...