Catherine Olsson

According to our database1, Catherine Olsson authored at least 23 papers between 2011 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Normalization by orientation-tuned surround in human V1-V3.
PLoS Comput. Biol., December, 2023

Specific versus General Principles for Constitutional AI.
CoRR, 2023

The Capacity for Moral Self-Correction in Large Language Models.
CoRR, 2023


2022
Discovering Language Model Behaviors with Model-Written Evaluations.
CoRR, 2022

Constitutional AI: Harmlessness from AI Feedback.
CoRR, 2022

In-context Learning and Induction Heads.
CoRR, 2022

Toy Models of Superposition.
CoRR, 2022

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned.
CoRR, 2022

Language Models (Mostly) Know What They Know.
CoRR, 2022

Scaling Laws and Interpretability of Learning from Repeated Data.
CoRR, 2022

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.
CoRR, 2022

Predictability and Surprise in Large Generative Models.
CoRR, 2022


2021
A General Language Assistant as a Laboratory for Alignment.
CoRR, 2021

2019
Dota 2 with Large Scale Deep Reinforcement Learning.
CoRR, 2019

TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing.
Proceedings of the 36th International Conference on Machine Learning, 2019

Discriminator Rejection Sampling.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Unrestricted Adversarial Examples.
CoRR, 2018

Skill Rating for Generative Models.
CoRR, 2018

Is Generator Conditioning Causally Related to GAN Performance?
Proceedings of the 35th International Conference on Machine Learning, 2018

2014
Predicting Actions from Static Scenes.
Proceedings of the Computer Vision - ECCV 2014, 2014

2011
Finding and Explaining Similarities in Linked Data.
Proceedings of the Sixth International Conference on Semantic Technologies for Intelligence, 2011


  Loading...