Nicholas Joseph
According to our database1,
Nicholas Joseph
authored at least 21 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, September, 2025
2023
CoRR, 2023
Towards Measuring the Representation of Subjective Global Opinions in Language Models.
CoRR, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned.
CoRR, 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.
CoRR, 2022
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022
2021