Dan Mossing

According to our database1, Dan Mossing authored at least 3 papers in 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Weight-sparse transformers have interpretable circuits.
CoRR, November, 2025

Persona Features Control Emergent Misalignment.
CoRR, June, 2025

Investigating task-specific prompts and sparse autoencoders for activation monitoring.
CoRR, April, 2025


  Loading...