Shaona Ghosh

Orcid: 0000-0003-4658-5174

According to our database1, Shaona Ghosh authored at least 25 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks.
CoRR, March, 2026

Beyond Explainable AI (XAI): An Overdue Paradigm Shift and Post-XAI Research Directions.
CoRR, February, 2026

2025
A Safety and Security Framework for Real-World Agentic Systems.
CoRR, November, 2025

SafeSteer: Interpretable Safety Steering with Refusal-Evasion in LLMs.
CoRR, June, 2025

Surfacing Semantic Orthogonality Across Model Safety Benchmarks: A Multi-Dimensional Analysis.
CoRR, May, 2025

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, March, 2025

AEGIS2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

GeoSAFE - A Novel Geospatial Artificial Intelligence Safety Assurance Framework and Evaluation for LLM Moderation.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

CultureGuard: Towards Culturally-Aware Dataset and Guard Model for Multilingual Safety Applications.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

A Simple Yet Effective Method for Non-Refusing Context Relevant Fine-grained Safety Steering in LLMs.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Guardrails and Security for LLMs: Safe, Secure and Controllable Steering of LLM Applications.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts), 2025

2024
Towards Inference-time Category-wise Safety Steering for Large Language Models.
CoRR, 2024

AEGIS: Online Adaptive AI Content Safety Moderation with Ensemble of LLM Experts.
CoRR, 2024

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues.
CoRR, 2024

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2021
Joint Learning of Portrait Intrinsic Decomposition and Relighting.
CoRR, 2021

2017
Neural Networks for Text Correction and Completion in Keyboard Decoding.
CoRR, 2017

2016
Network Lasso Optimization For Smart City Ride Share Prediction.
CoRR, 2016

Online machine learning for networked data.
AI Matters, 2016

Extended Formulations for Online Action Selection on Big Action Sets.
Proceedings of the Advances in Big Data, 2016

2015
Ising Bandits with Side Information.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Online Prediction at the Limit of Zero Temperature.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Online Mean Field Approximation for Automated Experimentation.
Proceedings of the 4th Workshop on Machine Learning for Interactive Systems, 2015

2013
Extended Formulations for Online Linear Bandit Optimization.
CoRR, 2013

Towards Pareto Descent Directions in Sampling Experts for Multiple Tasks in an On-Line Learning Paradigm.
Proceedings of the Lifelong Machine Learning, 2013


  Loading...