János Kramár

According to our database1, János Kramár authored at least 21 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
AtP*: An efficient and scalable method for localizing LLM behaviour to components.
CoRR, 2024

2023
Explaining grokking through circuit efficiency.
CoRR, 2023

The Hydra Effect: Emergent Self-repair in Language Model Computations.
CoRR, 2023

Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla.
CoRR, 2023

Power-seeking can be probable and predictive for trained agents.
CoRR, 2023

Tracr: Compiled Transformers as a Laboratory for Interpretability.
CoRR, 2023

Tracr: Compiled Transformers as a Laboratory for Interpretability.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Sample-based Approximation of Nash in Large Many-Player Games via Gradient Descent.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021
A Neural Network Auction For Group Decision Making Over a Continuous Space.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020
Learning to Play No-Press Diplomacy with Best Response Policy Iteration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Should I Tear down This Wall? Optimizing Social Metrics by Evaluating Novel Actions.
Proceedings of the Coordination, Organizations, Institutions, Norms, and Ethics for Governance of Multi-Agent Systems XIII, 2020

2019
OpenSpiel: A Framework for Reinforcement Learning in Games.
CoRR, 2019

Learning Reciprocity in Complex Sequential Social Dilemmas.
CoRR, 2019

Relational Forward Models for Multi-Agent Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

The Imitation Game: Learned Reciprocity in Markov games.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills.
Proceedings of the Robotics: Science and Systems XIV, 2018

2017
Guidelines for Artificial Intelligence Containment.
CoRR, 2017

Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations.
CoRR, 2016

The AGI Containment Problem.
Proceedings of the Artificial General Intelligence - 9th International Conference, 2016

2010
A Generalized-Zero-Preserving Method for Compact Encoding of Concept Lattices.
Proceedings of the ACL 2010, 2010


  Loading...