Jack Clark

Orcid: 0000-0003-3886-7657

According to our database1, Jack Clark authored at least 32 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training.
CoRR, 2024

Validating Database System Isolation Level Implementations with Version Certificate Recovery.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

2023
Taking Back Control in an Intermediate Representation for GPU Computing.
Proc. ACM Program. Lang., January, 2023

Artificial Intelligence Index Report 2023.
CoRR, 2023

Towards Measuring the Representation of Subjective Global Opinions in Language Models.
CoRR, 2023

Model evaluation for extreme risks.
CoRR, 2023

Regulatory Markets: The Future of AI Governance.
CoRR, 2023

The Capacity for Moral Self-Correction in Large Language Models.
CoRR, 2023

Visual Information Processing in Virtual Reality: Merging Theory and Practice.
Proceedings of the Information for a Better World: Normality, Virtuality, Physicality, Inclusivity, 2023


2022
Discovering Language Model Behaviors with Model-Written Evaluations.
CoRR, 2022

In-context Learning and Induction Heads.
CoRR, 2022

Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned.
CoRR, 2022

Language Models (Mostly) Know What They Know.
CoRR, 2022

The AI Index 2022 Annual Report.
CoRR, 2022

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.
CoRR, 2022

Predictability and Surprise in Large Generative Models.
CoRR, 2022


2021
A General Language Assistant as a Laboratory for Alignment.
CoRR, 2021

Why and How Governments Should Monitor AI Development.
CoRR, 2021

Evaluating CLIP: Towards Characterization of Broader Capabilities and Downstream Implications.
CoRR, 2021

The AI Index 2021 Annual Report.
CoRR, 2021

Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models.
CoRR, 2021

Learning Transferable Visual Models From Natural Language Supervision.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Measurement in AI Policy: Opportunities and Challenges.
CoRR, 2020

Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims.
CoRR, 2020

Regulatory Markets for AI Safety.
CoRR, 2020


2019
Release Strategies and the Social Impacts of Language Models.
CoRR, 2019

2018
The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation.
CoRR, 2018

2006
A Sequence Given in Terms of Harmonic Numbers: 11115.
Am. Math. Mon., 2006

2004
Problem 11115.
Am. Math. Mon., 2004


  Loading...