Rajashree Agrawal

Orcid: 0000-0001-7617-9180

According to our database1, Rajashree Agrawal authored at least 8 papers between 2024 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Towards a Scalable Proof Engine: A Performant Prototype Rewriting Primitive for Coq.
J. Autom. Reason., September, 2024

Modular addition without black-boxes: Compressing explanations of MLPs that compute numerical integration.
CoRR, 2024

Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach.
CoRR, 2024

When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
CoRR, 2024

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data.
CoRR, 2024

Compact Proofs of Model Performance via Mechanistic Interpretability.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024



  Loading...