Daniel M. Ziegler

Affiliations:
  • Redwood Research, Berkeley, CA, USA
  • OpenAI, San Francisco, CA, USA (former)


According to our database1, Daniel M. Ziegler authored at least 8 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training.
CoRR, 2024

2022
Adversarial training for high-stakes reliability.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Recursively Summarizing Books with Human Feedback.
CoRR, 2021

2020
Scaling Laws for Autoregressive Generative Modeling.
CoRR, 2020

Learning to summarize from human feedback.
CoRR, 2020

Learning to summarize with human feedback.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020


2019
Fine-Tuning Language Models from Human Preferences.
CoRR, 2019


  Loading...