Deep Ganguli

According to our database1, Deep Ganguli authored at least 22 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training.
CoRR, 2024

2023
Evaluating and Mitigating Discrimination in Language Model Decisions.
CoRR, 2023

Report of the 1st Workshop on Generative AI and Law.
CoRR, 2023

Towards Measuring the Representation of Subjective Global Opinions in Language Models.
CoRR, 2023

Opportunities and Risks of LLMs for Scalable Deliberation with Polis.
CoRR, 2023

The Capacity for Moral Self-Correction in Large Language Models.
CoRR, 2023


2022
Discovering Language Model Behaviors with Model-Written Evaluations.
CoRR, 2022

Constitutional AI: Harmlessness from AI Feedback.
CoRR, 2022

In-context Learning and Induction Heads.
CoRR, 2022

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned.
CoRR, 2022

Language Models (Mostly) Know What They Know.
CoRR, 2022

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.
CoRR, 2022

Predictability and Surprise in Large Generative Models.
CoRR, 2022


2021
starfish: scalable pipelines for image-based transcriptomics.
J. Open Source Softw., 2021

A General Language Assistant as a Laboratory for Alignment.
CoRR, 2021

The AI Index 2021 Annual Report.
CoRR, 2021

Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models.
CoRR, 2021

2014
Efficient Sensory Encoding and Bayesian Inference with Heterogeneous Neural Populations.
Neural Comput., 2014

Druid: a real-time analytical data store.
Proceedings of the International Conference on Management of Data, 2014

2010
Implicit encoding of prior probabilities in optimal neural populations.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010


  Loading...