David Krueger

Orcid: 0000-0001-7256-0937

According to our database1, David Krueger authored at least 45 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Safety Cases: How to Justify the Safety of Advanced AI Systems.
CoRR, 2024

A Generative Model of Symmetry Transformations.
CoRR, 2024

Black-Box Access is Insufficient for Rigorous AI Audits.
CoRR, 2024

Visibility into AI Agents.
CoRR, 2024

2023
Hazards from Increasingly Accessible Fine-Tuning of Downloadable Foundation Models.
CoRR, 2023

Managing AI Risks in an Era of Rapid Progress.
CoRR, 2023

Meta- (out-of-context) learning in neural networks.
CoRR, 2023

Reward Model Ensembles Help Mitigate Overoptimization.
CoRR, 2023

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback.
CoRR, 2023

Investigating the Nature of 3D Generalization in Deep Neural Networks.
CoRR, 2023

Unifying Grokking and Double Descent.
CoRR, 2023

Blockwise Self-Supervised Learning at Scale.
CoRR, 2023

On The Fragility of Learned Reward Functions.
CoRR, 2023

Thinker: Learning to Plan and Act.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Broken Neural Scaling Laws.
Proceedings of the Eleventh International Conference on Learning Representations, 2023


Characterizing Manipulation from AI Systems.
Proceedings of the 3rd ACM Conference on Equity and Access in Algorithms, 2023

2022
Domain Generalization for Robust Model-Based Offline Reinforcement Learning.
CoRR, 2022

Towards Out-of-Distribution Adversarial Robustness.
CoRR, 2022

Defining and Characterizing Reward Hacking.
CoRR, 2022

Defining and Characterizing Reward Gaming.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Goal Misgeneralization in Deep Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

2021
Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models.
CoRR, 2021

Filling gaps in trustworthy development of AI.
CoRR, 2021

Out-of-Distribution Generalization via Risk Extrapolation (REx).
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Active Reinforcement Learning: Observing Rewards at a Cost.
CoRR, 2020

Hidden Incentives for Auto-Induced Distributional Shift.
CoRR, 2020

AI Research Considerations for Human Existential Safety (ARCHES).
CoRR, 2020

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims.
CoRR, 2020

Out-of-Distribution Generalization via Risk Extrapolation (REx).
CoRR, 2020

2018
Scalable agent alignment via reward modeling: a research direction.
CoRR, 2018

Uncertainty in Multitask Transfer Learning.
CoRR, 2018

Neural Autoregressive Flows.
Proceedings of the 35th International Conference on Machine Learning, 2018

2017
Deep Prior.
CoRR, 2017

Bayesian Hypernetworks.
CoRR, 2017

A Closer Look at Memorization in Deep Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017

Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations.
Proceedings of the 5th International Conference on Learning Representations, 2017

Deep Nets Don't Learn via Memorization.
Proceedings of the 5th International Conference on Learning Representations, 2017

Nested LSTMs.
Proceedings of The 9th Asian Conference on Machine Learning, 2017

2016
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations.
CoRR, 2016

Regularizing RNNs by Stabilizing Activations.
Proceedings of the 4th International Conference on Learning Representations, 2016

2015
Zero-bias autoencoders and the benefits of co-adapting features.
Proceedings of the 3rd International Conference on Learning Representations, 2015

NICE: Non-linear Independent Components Estimation.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Testing Visual Attention in Dynamic Environments.
CoRR, 2015


  Loading...