David Krueger

Orcid: 0000-0001-7256-0937

According to our database¹, David Krueger authored at least 45 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Safety Cases: How to Justify the Safety of Advanced AI Systems.

[BibT_eX]

[DOI]

CoRR, 2024

A Generative Model of Symmetry Transformations.

[BibT_eX]

[DOI]

James Urquhart Allingham

Bruno Kacper Mlodozeniec

José Miguel Hernández-Lobato

CoRR, 2024

Black-Box Access is Insufficient for Rigorous AI Audits.

[BibT_eX]

[DOI]

CoRR, 2024

Visibility into AI Agents.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Hazards from Increasingly Accessible Fine-Tuning of Downloadable Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

Managing AI Risks in an Era of Rapid Progress.

[BibT_eX]

[DOI]

CoRR, 2023

Meta- (out-of-context) learning in neural networks.

[BibT_eX]

[DOI]

Dmitrii Krasheninnikov

Egor Krasheninnikov

Bruno Mlodozeniec

David Krueger

CoRR, 2023

Reward Model Ensembles Help Mitigate Overoptimization.

[BibT_eX]

[DOI]

CoRR, 2023

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Investigating the Nature of 3D Generalization in Deep Neural Networks.

[BibT_eX]

[DOI]

Shoaib Ahmed Siddiqui

David Krueger

Thomas M. Breuel

CoRR, 2023

Unifying Grokking and Double Descent.

[BibT_eX]

[DOI]

Xander Davies

Lauro Langosco

David Krueger

CoRR, 2023

Blockwise Self-Supervised Learning at Scale.

[BibT_eX]

[DOI]

Shoaib Ahmed Siddiqui

David Krueger

Yann LeCun

Stéphane Deny

CoRR, 2023

On The Fragility of Learned Reward Functions.

[BibT_eX]

[DOI]

CoRR, 2023

Thinker: Learning to Plan and Act.

[BibT_eX]

[DOI]

Stephen Chung

Ivan Anokhin

David Krueger

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics.

[BibT_eX]

[DOI]

Shoaib Ahmed Siddiqui

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Broken Neural Scaling Laws.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Harms from Increasingly Agentic Algorithmic Systems.

[BibT_eX]

[DOI]

Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

Characterizing Manipulation from AI Systems.

[BibT_eX]

[DOI]

Proceedings of the 3rd ACM Conference on Equity and Access in Algorithms, 2023

2022

Domain Generalization for Robust Model-Based Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Alan Clark

Shoaib Ahmed Siddiqui

CoRR, 2022

Towards Out-of-Distribution Adversarial Robustness.

[BibT_eX]

[DOI]

Adam Ibrahim

Charles Guille-Escuret

CoRR, 2022

Defining and Characterizing Reward Hacking.

[BibT_eX]

[DOI]

Joar Skalse

Nikolaus H. R. Howe

Dmitrii Krasheninnikov

David Krueger

CoRR, 2022

Defining and Characterizing Reward Gaming.

[BibT_eX]

[DOI]

Joar Skalse

Nikolaus H. R. Howe

Dmitrii Krasheninnikov

David Krueger

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Goal Misgeneralization in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Lauro Langosco di Langosco

Proceedings of the International Conference on Machine Learning, 2022

2021

Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models.

[BibT_eX]

[DOI]

CoRR, 2021

Filling gaps in trustworthy development of AI.

[BibT_eX]

[DOI]

CoRR, 2021

Out-of-Distribution Generalization via Risk Extrapolation (REx).

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Active Reinforcement Learning: Observing Rewards at a Cost.

[BibT_eX]

[DOI]

CoRR, 2020

Hidden Incentives for Auto-Induced Distributional Shift.

[BibT_eX]

[DOI]

David Krueger

Tegan Maharaj

Jan Leike

CoRR, 2020

AI Research Considerations for Human Existential Safety (ARCHES).

[BibT_eX]

[DOI]

Andrew Critch

David Krueger

CoRR, 2020

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims.

[BibT_eX]

[DOI]

Thomas Krendl Gilbert

CoRR, 2020

Out-of-Distribution Generalization via Risk Extrapolation (REx).

[BibT_eX]

[DOI]

CoRR, 2020

2018

Scalable agent alignment via reward modeling: a research direction.

[BibT_eX]

[DOI]

CoRR, 2018

Uncertainty in Multitask Transfer Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Neural Autoregressive Flows.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

2017

Deep Prior.

[BibT_eX]

[DOI]

CoRR, 2017

Bayesian Hypernetworks.

[BibT_eX]

[DOI]

CoRR, 2017

A Closer Look at Memorization in Deep Networks.

[BibT_eX]

[DOI]

Devansh Arpit

Stanislaw Jastrzebski

Proceedings of the 34th International Conference on Machine Learning, 2017

Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Deep Nets Don't Learn via Memorization.

[BibT_eX]

[DOI]

David Krueger

Nicolas Ballas

Stanislaw Jastrzebski

Proceedings of the 5th International Conference on Learning Representations, 2017

Nested LSTMs.

[BibT_eX]

[DOI]

Joel Ruben Antony Moniz

David Krueger

Proceedings of The 9th Asian Conference on Machine Learning, 2017

2016

Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations.

[BibT_eX]

[DOI]

CoRR, 2016

Regularizing RNNs by Stabilizing Activations.

[BibT_eX]

[DOI]

David Krueger

Roland Memisevic

Proceedings of the 4th International Conference on Learning Representations, 2016

2015

Zero-bias autoencoders and the benefits of co-adapting features.

[BibT_eX]

[DOI]

Roland Memisevic

Kishore Reddy Konda

David Krueger

Proceedings of the 3rd International Conference on Learning Representations, 2015

NICE: Non-linear Independent Components Estimation.

[BibT_eX]

[DOI]

Laurent Dinh

David Krueger

Yoshua Bengio

Proceedings of the 3rd International Conference on Learning Representations, 2015

Testing Visual Attention in Dynamic Environments.

[BibT_eX]

[DOI]

Philip Bachman

David Krueger

Doina Precup

CoRR, 2015

David Krueger

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...