Abir Harrasse

Orcid: 0009-0003-8534-7934

According to our database1, Abir Harrasse authored at least 8 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs.
CoRR, March, 2026

Curveball Steering: The Right Direction To Steer Isn't Always Linear.
CoRR, March, 2026

Debate, Deliberate, Decide (D3): A Cost-Aware Adversarial Framework for Reliable and Interpretable LLM Evaluation.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Tracing Multilingual Representations in LLMs with Cross-Layer Transcoders.
CoRR, November, 2025

TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research.
CoRR, March, 2025

Activation Space Interventions Can Be Transferred Between Large Language Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Adversarial Multi-Agent Evaluation of Large Language Models through Iterative Debates.
CoRR, 2024


  Loading...