Joe Needham

According to our database1, Joe Needham authored at least 2 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Large Language Models Often Know When They Are Being Evaluated.
CoRR, May, 2025

2024
Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack.
CoRR, 2024


  Loading...