Aradhana Sinha

Orcid: 0009-0006-0092-8214

According to our database1, Aradhana Sinha authored at least 6 papers between 2023 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
The Singapore Consensus on Global AI Safety Research Priorities.
CoRR, June, 2025

2024
Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks.
Trans. Mach. Learn. Res., 2024

InfAlign: Inference-aware language model alignment.
CoRR, 2024

Automated Adversarial Discovery for Safety Classifiers.
CoRR, 2024

Generalized People Diversity: Learning a Human Perception-Aligned Diversity Representation for People Images.
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

2023
Improving Few-shot Generalization of Safety Classifiers via Data Augmented Parameter-Efficient Fine-Tuning.
CoRR, 2023


  Loading...