Fabien Roger

According to our database1, Fabien Roger authored at least 5 papers between 2022 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
AI Control: Improving Safety Despite Intentional Subversion.
CoRR, 2023

Preventing Language Models From Hiding Their Reasoning.
CoRR, 2023

Measurement Tampering Detection Benchmark.
CoRR, 2023

Large Language Models Sometimes Generate Purely Negatively-Reinforced Text.
CoRR, 2023

2022
Language models are better than humans at next-token prediction.
CoRR, 2022


  Loading...