Himabindu Lakkaraju

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

The First Workshop on AI Behavioral Science.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

In-Context Unlearning: Language Models as Few-Shot Unlearners.

[BibT_eX]

[DOI]

Seth Neel

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Understanding the Effects of Iterative Prompting on Truthfulness.

[BibT_eX]

[DOI]

Satyapriya Krishna

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Quantifying Uncertainty in Natural Language Explanations of Large Language Models.

[BibT_eX]

[DOI]

Sree Harsha Tanneru

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Fair Machine Unlearning: Data Removal while Mitigating Disparities.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

Explaining machine learning models with interactive natural language conversations using TalkToModel.

[BibT_eX]

[DOI]

Nat. Mac. Intell., August, 2023

When Does Uncertainty Matter?: Understanding the Impact of Predictive Uncertainty in ML Assisted Decision Making.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Is Ignorance Bliss? The Role of Post Hoc Explanation Faithfulness and Alignment in Model Trust in Laypeople and Domain Experts.

[BibT_eX]

[DOI]

CoRR, 2023

Investigating the Fairness of Large Language Models for Predictions on Tabular Data.

[BibT_eX]

[DOI]

CoRR, 2023

Are Large Language Models Post Hoc Explainers?

[BibT_eX]

[DOI]

CoRR, 2023

On the Trade-offs between Adversarial Robustness and Actionable Explanations.

[BibT_eX]

[DOI]

Satyapriya Krishna

CoRR, 2023

Certifying LLM Safety against Adversarial Prompting.

[BibT_eX]

[DOI]

CoRR, 2023

Accurate, Explainable, and Private Models: Providing Recourse While Minimizing Training Data Leakage.

[BibT_eX]

[DOI]

CoRR, 2023

Verifiable Feature Attributions: A Bridge between Post Hoc Explainability and Inherent Interpretability.

[BibT_eX]

[DOI]

Usha Bhalla

CoRR, 2023

Efficient Estimation of the Local Robustness of Machine Learning Models.

[BibT_eX]

[DOI]

Tessa Han

CoRR, 2023

Analyzing Chain-of-Thought Prompting in Large Language Models via Gradient-based Feature Attributions.

[BibT_eX]

[DOI]

CoRR, 2023

Consistent Explanations in the Face of Model Indeterminacy via Ensembling.

[BibT_eX]

[DOI]

CoRR, 2023

Word-Level Explanations for Analyzing Bias in Text-to-Image Models.

[BibT_eX]

[DOI]

CoRR, 2023

Tutorials at The Web Conference 2023.

[BibT_eX]

[DOI]

Behrooz Omidvar-Tehrani

Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

On Minimizing the Impact of Dataset Shifts on Actionable Explanations.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Which Models have Perceptually-Aligned Gradients? An Explanation via Off-Manifold Robustness.

[BibT_eX]

[DOI]

Sebastian Bordt

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Post Hoc Explanations of Language Models Can Improve Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Discriminative Feature Attributions: Bridging Post Hoc Explainability and Inherent Interpretability.

[BibT_eX]

[DOI]

Usha Bhalla

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

M<sup>4</sup>: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Generative AI meets Responsible AI: Practical Challenges and Opportunities.

[BibT_eX]

[DOI]

Nazneen Rajani

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Towards Bridging the Gaps between the Right to Explanation and the Right to be Forgotten.

[BibT_eX]

[DOI]

Satyapriya Krishna

Jiaqi Ma

Proceedings of the International Conference on Machine Learning, 2023

On the Impact of Algorithmic Recourse on Social Segregation.

[BibT_eX]

[DOI]

Ruijiang Gao

Proceedings of the International Conference on Machine Learning, 2023

Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse.

[BibT_eX]

[DOI]

Teresa Datta

Johannes van den Heuvel

Gjergji Kasneci

Proceedings of the Eleventh International Conference on Learning Representations, 2023

On the Privacy Risks of Algorithmic Recourse.

[BibT_eX]

[DOI]

Seth Neel

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022

Evaluating Explainability for Graph Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2022

TalkToModel: Understanding Machine Learning Models With Open Ended Dialogues.

[BibT_eX]

[DOI]

CoRR, 2022

Flatten the Curve: Efficiently Training Low-Curvature Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2022

A Human-Centric Take on Model Monitoring.

[BibT_eX]

[DOI]

Murtuza N. Shergadwala

CoRR, 2022

Rethinking Stability for Attribution-based Explanations.

[BibT_eX]

[DOI]

CoRR, 2022

Algorithmic Recourse in the Face of Noisy Human Responses.

[BibT_eX]

[DOI]

Teresa Datta

Johannes van den Heuvel

Gjergji Kasneci

CoRR, 2022

Rethinking Explainability as a Dialogue: A Practitioner's Perspective.

[BibT_eX]

[DOI]

CoRR, 2022

The Disagreement Problem in Explainable Machine Learning: A Practitioner's Perspective.

[BibT_eX]

[DOI]

CoRR, 2022

Data poisoning attacks on off-policy policy evaluation methods.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2022

Efficient Training of Low-Curvature Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post Hoc Explanations.

[BibT_eX]

[DOI]

Tessa Han

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

OpenXAI: Towards a Transparent Evaluation of Model Explanations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Model Monitoring in Practice: Lessons Learned and Open Challenges.

[BibT_eX]

[DOI]

Pradeep Natarajan

Mehrnoosh Sameki

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

A Human-Centric Perspective on Model Monitoring.

[BibT_eX]

[DOI]

Murtuza N. Shergadwala

Proceedings of the Tenth AAAI Conference on Human Computation and Crowdsourcing, 2022

Exploring Counterfactual Explanations Through the Lens of Adversarial Examples: A Theoretical and Empirical Analysis.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Probing GNN Explainers: A Rigorous Theoretical and Empirical Analysis of GNN Explanation Methods.

[BibT_eX]

[DOI]

Marinka Zitnik

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Towards Robust Off-Policy Evaluation via Human Inputs.

[BibT_eX]

[DOI]

Proceedings of the AIES '22: AAAI/ACM Conference on AI, Ethics, and Society, Oxford, United Kingdom, May 19, 2022

Fairness via Explanation Quality: Evaluating Disparities in the Quality of Post hoc Explanations.

[BibT_eX]

[DOI]

Proceedings of the AIES '22: AAAI/ACM Conference on AI, Ethics, and Society, Oxford, United Kingdom, May 19, 2022

2021

What will it take to generate fairness-preserving explanations?

[BibT_eX]

[DOI]

CoRR, 2021

Feature Attributions and Counterfactual Explanations Can Be Manipulated.

[BibT_eX]

[DOI]

CoRR, 2021

On the Connections between Counterfactual Explanations and Adversarial Examples.

[BibT_eX]

[DOI]

CoRR, 2021

Towards a Rigorous Theoretical Analysis and Evaluation of GNN Explanations.

[BibT_eX]

[DOI]

Marinka Zitnik

CoRR, 2021

Counterfactual Explanations Can Be Manipulated.

[BibT_eX]

[DOI]

CoRR, 2021

Learning Under Adversarial and Interventional Shifts.

[BibT_eX]

[DOI]

CoRR, 2021

Towards a unified framework for fair and stable graph representation learning.

[BibT_eX]

[DOI]

Marinka Zitnik

Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Towards Robust and Reliable Algorithmic Recourse.

[BibT_eX]

[DOI]

Sohini Upadhyay

Shalmali Joshi

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Reliable Post hoc Explanations: Modeling Uncertainty in Explainability.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Counterfactual Explanations Can Be Manipulated.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Models for Actionable Recourse.

[BibT_eX]

[DOI]

Alexis Ross

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Towards the Unification and Robustness of Perturbation and Gradient Based Explanations.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Towards Reliable and Practicable Algorithmic Recourse.

[BibT_eX]

[DOI]

Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Does Fair Ranking Improve Minority Outcomes? Understanding the Interplay of Human and Algorithmic Biases in Online Hiring.

[BibT_eX]

[DOI]

Tom Sühr

Sophie Hilgard

Proceedings of the AIES '21: AAAI/ACM Conference on AI, 2021

Fair Influence Maximization: a Welfare Optimization Approach.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Can I Still Trust You?: Understanding the Impact of Distribution Shifts on Algorithmic Recourses.

[BibT_eX]

[DOI]

Kaivalya Rawal

Ece Kamar

CoRR, 2020

Ensuring Actionable Recourse via Adversarial Training.

[BibT_eX]

[DOI]

Alexis Ross

CoRR, 2020

Interpretable and Interactive Summaries of Actionable Recourses.

[BibT_eX]

[DOI]

Kaivalya Rawal

CoRR, 2020

How Much Should I Trust You? Modeling Uncertainty of Black Box Explanations.

[BibT_eX]

[DOI]

CoRR, 2020

Fair Influence Maximization: A Welfare Optimization Approach.

[BibT_eX]

[DOI]

CoRR, 2020

Incorporating Interpretable Output Constraints in Bayesian Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Beyond Individualized Recourse: Interpretable and Interactive Summaries of Actionable Recourses.

[BibT_eX]

[DOI]

Kaivalya Rawal

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Robust and Stable Black Box Explanations.

[BibT_eX]

[DOI]

Nino Arsov

Proceedings of the 37th International Conference on Machine Learning, 2020

Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods.

[BibT_eX]

[DOI]

Proceedings of the AIES '20: AAAI/ACM Conference on AI, 2020

"How do I fool you?": Manipulating User Trust via Misleading Black Box Explanations.

[BibT_eX]

[DOI]

Proceedings of the AIES '20: AAAI/ACM Conference on AI, 2020

2019

How can we fool LIME and SHAP? Adversarial Attacks on Post hoc Explanation Methods.

[BibT_eX]

[DOI]

CoRR, 2019

Faithful and Customizable Explanations of Black Box Models.

[BibT_eX]

[DOI]

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 2019

2018

Human-centric machine learning: enabling machine learning for high-stakes decision-making.

[BibT_eX]

[DOI]

PhD thesis, 2018

2017

Interpretable & Explorable Approximations of Black Box Models.

[BibT_eX]

[DOI]

CoRR, 2017

The Selective Labels Problem: Evaluating Algorithmic Predictions in the Presence of Unobservables.

[BibT_eX]

[DOI]

Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Learning Cost-Effective and Interpretable Treatment Regimes.

[BibT_eX]

[DOI]

Cynthia Rudin

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

Identifying Unknown Unknowns in the Open World: Representations and Policies for Guided Exploration.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Psycho-Demographic Analysis of the Facebook Rainbow Campaign.

[BibT_eX]

[DOI]

CoRR, 2016

Learning Cost-Effective Treatment Regimes using Markov Decision Processes.

[BibT_eX]

[DOI]

Cynthia Rudin

CoRR, 2016

Discovering Blind Spots of Predictive Models: Representations and Policies for Guided Exploration.

[BibT_eX]

[DOI]

CoRR, 2016

Confusions over Time: An Interpretable Bayesian Model to Characterize Trends in Decision Making.

[BibT_eX]

[DOI]

Jure Leskovec

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Interpretable Decision Sets: A Joint Framework for Description and Prediction.

[BibT_eX]

[DOI]

Stephen H. Bach

Jure Leskovec

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

2015

A Bayesian Framework for Modeling Human Evaluations.

[BibT_eX]

[DOI]

Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Who, when, and why: a machine learning approach to prioritizing students at risk of not graduating high school on time.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Learning Analytics And Knowledge, 2015

A Machine Learning Framework to Identify Students at Risk of Adverse Academic Outcomes.

[BibT_eX]

[DOI]

Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

2013

What's in a Name? Understanding the Interplay between Titles, Content, and Communities in Social Media.

[BibT_eX]

[DOI]

Julian J. McAuley

Jure Leskovec

Proceedings of the Seventh International Conference on Weblogs and Social Media, 2013

2012

TEM: a novel perspective to modeling content onmicroblogs.

[BibT_eX]

[DOI]

Hyung-Il Ahn

Proceedings of the 21st World Wide Web Conference, 2012

Dynamic Multi-relational Chinese Restaurant Process for Analyzing Influences on Users in Social Media.

[BibT_eX]

[DOI]

Indrajit Bhattacharya

Chiranjib Bhattacharyya

Proceedings of the 12th IEEE International Conference on Data Mining, 2012

2011

Smart news feeds for social networks using scalable joint latent factor models.

[BibT_eX]

[DOI]

Angshu Rai

Srujana Merugu

Proceedings of the 20th International Conference on World Wide Web, 2011

Exploiting Coherence for the Simultaneous Discovery of Latent Facets and associated Sentiments.

[BibT_eX]

[DOI]

Chiranjib Bhattacharyya

Indrajit Bhattacharya

Srujana Merugu

Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Attention prediction on social media brand pages.

[BibT_eX]

[DOI]