Rom Himelstein

According to our database1, Rom Himelstein authored at least 7 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Step-Wise Refusal Dynamics in Autoregressive and Diffusion Language Models.
CoRR, February, 2026

Silenced Biases: The Dark Side LLMs Learned to Refuse.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
You Had One Job: Per-Task Quantization Using LLMs' Hidden Representations.
CoRR, November, 2025

Silent Tokens, Loud Effects: Padding in LLMs.
CoRR, October, 2025

Leveraging NTPs for Efficient Hallucination Detection in VLMs.
CoRR, September, 2025

Enhancing Jailbreak Attacks via Compliance-Refusal-Based Initialization.
CoRR, February, 2025

Jailbreak Attack Initializations as Extractors of Compliance Directions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025


  Loading...