Gonçalo Paulo

According to our database1, Gonçalo Paulo authored at least 8 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Evaluating SAE interpretability without explanations.
CoRR, July, 2025

When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research.
CoRR, May, 2025

Partially Rewriting a Transformer in Natural Language.
CoRR, January, 2025

Transcoders Beat Sparse Autoencoders for Interpretability.
CoRR, January, 2025

Sparse Autoencoders Trained on the Same Data Learn Different Features.
CoRR, January, 2025

Do Transformer Interpretability Methods Transfer to RNNs?
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Automatically Interpreting Millions of Features in Large Language Models.
CoRR, 2024

Does Transformer Interpretability Transfer to RNNs?
CoRR, 2024


  Loading...