Johannes Treutlein

According to our database¹, Johannes Treutlein authored at least 12 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

School of Reward Hacks: Hacking harmless tasks generalizes to misaligned behavior in LLMs.

[BibT_eX]

[DOI]

CoRR, August, 2025

Auditing language models for hidden objectives.

[BibT_eX]

[DOI]

CoRR, March, 2025

2024

Alignment faking in large language models.

[BibT_eX]

[DOI]

CoRR, 2024

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023

Conditioning Predictive Models: Risks and Strategies.

[BibT_eX]

[DOI]

CoRR, 2023

Incentivizing honest performative predictions with proper scoring rules.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Similarity-based cooperative equilibrium.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

Similarity-based Cooperation.

[BibT_eX]

[DOI]

CoRR, 2022

Path Independent Equilibrium Models Can Better Exploit Test-Time Computation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

COLA: Consistent Learning with Opponent-Learning Awareness.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

2021

Normative Disagreement as a Challenge for Cooperative AI.

[BibT_eX]

[DOI]

CoRR, 2021

A New Formalism, Method and Open Issues for Zero-Shot Coordination.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Johannes Treutlein

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...