Newton Cheng

According to our database1, Newton Cheng authored at least 7 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training.
CoRR, 2024

2023
Specific versus General Principles for Constitutional AI.
CoRR, 2023

Towards Understanding Sycophancy in Language Models.
CoRR, 2023

Measuring Faithfulness in Chain-of-Thought Reasoning.
CoRR, 2023

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning.
CoRR, 2023

2022
Topological Link Models of Multipartite Entanglement.
Quantum, 2022

2020
The Quantum Entropy Cone of Hypergraphs.
CoRR, 2020


  Loading...