Narmeen Oozeer

According to our database1, Narmeen Oozeer authored at least 15 papers between 2024 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Riemannian-Manifold Steering: Geometry-Aware Generative Autoencoders for Label-Free Steering.
CoRR, May, 2026

Geometry-Aware CLIP Retrieval via Local Cross-Modal Alignment and Steering.
CoRR, April, 2026

DreamReader: An Interpretability Toolkit for Text-to-Image Models.
CoRR, March, 2026

Understanding and Mitigating Dataset Corruption in LLM Steering.
CoRR, March, 2026

Spectral Superposition: A Theory of Feature Geometry.
CoRR, February, 2026

Are LLM Evaluators Really Narcissists? Sanity Checking Self-Preference Evaluations.
CoRR, January, 2026

Make Mechanistic Interpretability Auditable: A Call to Develop Guidelines via Continuous Collaborative Reviewing.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Approximating Human Preferences Using a Multi-Judge Learned System.
CoRR, October, 2025

Position: Require Frontier AI Labs To Release Small "Analog" Models.
CoRR, October, 2025

Breaking the Mirror: Activation-Based Mitigation of Self-Preference in LLM Evaluators.
CoRR, September, 2025

Distribution-Aware Feature Selection for SAEs.
CoRR, August, 2025

Beyond Monoliths: Expert Orchestration for More Capable, Democratic, and Safe Large Language Models.
CoRR, June, 2025

Activation Space Interventions Can Be Transferred Between Large Language Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Beyond Linear Steering: Unified Multi-Attribute Control for Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
Bilinear Convolution Decomposition for Causal RL Interpretability.
CoRR, 2024


  Loading...