Rajashree Agrawal

Orcid: 0000-0001-7617-9180

According to our database¹, Rajashree Agrawal authored at least 8 papers between 2024 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Failures to Find Transferable Image Jailbreaks Between Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Towards a Scalable Proof Engine: A Performant Prototype Rewriting Primitive for Coq.

[BibT_eX]

[DOI]

J. Autom. Reason., September, 2024

Modular addition without black-boxes: Compressing explanations of MLPs that compute numerical integration.

[BibT_eX]

[DOI]

CoRR, 2024

Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach.

[BibT_eX]

[DOI]

CoRR, 2024

When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?

[BibT_eX]

[DOI]

CoRR, 2024

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data.

[BibT_eX]

[DOI]

Matthias Gerstgrasser

CoRR, 2024

Compact Proofs of Model Performance via Mechanistic Interpretability.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Many-shot Jailbreaking.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Rajashree Agrawal

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...