Ansh Radhakrishnan

According to our database1, Ansh Radhakrishnan authored at least 7 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Reasoning Models Don't Always Say What They Think.
CoRR, May, 2025

Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training.
CoRR, 2024


Debating with More Persuasive LLMs Leads to More Truthful Answers.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Measuring Faithfulness in Chain-of-Thought Reasoning.
CoRR, 2023

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning.
CoRR, 2023


  Loading...