Adi Simhi

According to our database1, Adi Simhi authored at least 9 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Old Habits Die Hard: How Conversational History Geometrically Traps LLMs.
CoRR, March, 2026

2025
BlackboxNLP-2025 MIB Shared Task: Improving Circuit Faithfulness via Better Edge Selection.
CoRR, October, 2025

HACK: Hallucinations Along Certainty and Knowledge Axes.
CoRR, October, 2025

ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs.
CoRR, October, 2025

Trust Me, I'm Wrong: High-Certainty Hallucinations in LLMs.
CoRR, February, 2025

Trust Me, I'm Wrong: LLMs Hallucinate with Certainty Despite Knowing the Answer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
Distinguishing Ignorance from Error in LLM Hallucinations.
CoRR, 2024

Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs.
CoRR, 2024

2023
Interpreting Embedding Spaces by Conceptualization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023


  Loading...