Kevin Ro Wang

Affiliations:
  • Redwood Research, USA


According to our database1, Kevin Ro Wang authored at least 2 papers between 2022 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 Small.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022


  Loading...