Trenton Bricken

According to our database1, Trenton Bricken authored at least 7 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Cross-Architecture Model Diffing with Crosscoders: Unsupervised Discovery of Differences Between LLMs.
CoRR, February, 2026

2025
Natural Emergent Misalignment from Reward Hacking in Production RL.
CoRR, November, 2025

Auditing language models for hidden objectives.
CoRR, March, 2025

2024

2023
Emergence of Sparse Representations from Noise.
Proceedings of the International Conference on Machine Learning, 2023

Sparse Distributed Memory is a Continual Learner.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2021
Attention Approximates Sparse Distributed Memory.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...