Abhinav Rao

According to our database1, Abhinav Rao authored at least 7 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
[WIP] Jailbreak Paradox: The Achilles' Heel of LLMs.
CoRR, 2024

NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models.
CoRR, 2024

Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
MALITE: Lightweight Malware Detection and Classification for Constrained Devices.
CoRR, 2023

Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks.
CoRR, 2023

Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin.
CoRR, 2022


  Loading...