Been Kim

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Subgoal-Based Explanations for Unreliable Intelligent Decision Support Systems.

[BibT_eX]

[DOI]

Devleena Das

Sonia Chernova

Proceedings of the 28th International Conference on Intelligent User Interfaces, 2023

On the Relationship Between Explanation and Prediction: A Causal View.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

Impossibility Theorems for Feature Attribution.

[BibT_eX]

[DOI]

CoRR, 2022

Human-Centered Concept Explanations for Neural Networks.

[BibT_eX]

[DOI]

Chih-Kuan Yeh

Pradeep Ravikumar

CoRR, 2022

Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DISSECT: Disentangled Simultaneous Explanations via Concept Traversals.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Post hoc Explanations may be Ineffective for Detecting Unknown Spurious Correlation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Human-Centered Concept Explanations for Neural Networks.

[BibT_eX]

[DOI]

Chih-Kuan Yeh

Pradeep Ravikumar

Proceedings of the Neuro-Symbolic Artificial Intelligence: The State of the Art, 2021

Explainable deep learning for efficient and robust pattern recognition: A survey of recent developments.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

Analyzing a Caching Model.

[BibT_eX]

[DOI]

CoRR, 2021

Acquisition of Chess Knowledge in AlphaZero.

[BibT_eX]

[DOI]

CoRR, 2021

Best of both worlds: local and global explanations with human-understandable concepts.

[BibT_eX]

[DOI]

Alan Karthikesalingam

CoRR, 2021

Machine Learning Techniques for Accountability.

[BibT_eX]

[DOI]

Finale Doshi-Velez

AI Mag., 2021

2020

On Completeness-aware Concept-Based Explanations in Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Debugging Tests for Model Explanations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Concept Bottleneck Models.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

The (Un)reliability of Saliency Methods.

[BibT_eX]

[DOI]

Proceedings of the Explainable AI: Interpreting, 2019

On Concept-Based Explanations in Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

BIM: Towards Quantitative Evaluation of Interpretability Methods with Ground Truth.

[BibT_eX]

[DOI]

Mengjiao Yang

CoRR, 2019

Towards Realistic Individual Recourse and Actionable Explanations in Black-Box Decision Making Systems.

[BibT_eX]

[DOI]

CoRR, 2019

Explaining Classifiers with Causal Concept Effect (CaCE).

[BibT_eX]

[DOI]

Yash Goyal

Uri Shalit

CoRR, 2019

Do Neural Networks Show Gestalt Phenomena? An Exploration of the Law of Closure.

[BibT_eX]

[DOI]

CoRR, 2019

Automating Interpretability: Discovering and Testing Visual Concepts Learned by Neural Networks.

[BibT_eX]

[DOI]

Amirata Ghorbani

James Wexler

CoRR, 2019

An Evaluation of the Human-Interpretability of Explanation.

[BibT_eX]

[DOI]

CoRR, 2019

Visualizing and Measuring the Geometry of BERT.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Benchmark for Interpretability Methods in Deep Neural Networks.

[BibT_eX]

[DOI]

Sara Hooker

Dumitru Erhan

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Towards Automatic Concept-based Explanations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Human Evaluation of Models Built for Interpretability.

[BibT_eX]

[DOI]

Proceedings of the Seventh AAAI Conference on Human Computation and Crowdsourcing, 2019

Human-Centered Tools for Coping with Imperfect Algorithms During Medical Decision-Making.

[BibT_eX]

[DOI]

Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

Interpreting Black Box Predictions using Fisher Kernels.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018

Proceedings of the 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018).

[BibT_eX]

[DOI]

Kush R. Varshney

Adrian Weller

CoRR, 2018

Evaluating Feature Importance Estimates.

[BibT_eX]

[DOI]

Sara Hooker

Dumitru Erhan

CoRR, 2018

xGEMs: Generating Examplars to Explain Black-Box Models.

[BibT_eX]

[DOI]

CoRR, 2018

To Trust Or Not To Trust A Classifier.

[BibT_eX]

[DOI]

Heinrich Jiang

Maya R. Gupta

CoRR, 2018

How do Humans Understand Explanations from Machine Learning Systems? An Evaluation of the Human-Interpretability of Explanation.

[BibT_eX]

[DOI]

CoRR, 2018

Human-in-the-Loop Interpretability Prior.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

To Trust Or Not To Trust A Classifier.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Sanity Checks for Saliency Maps.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV).

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Learning how to explain neural networks: PatternNet and PatternAttribution.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

Local Explanation Methods for Deep Neural Networks Lack Sensitivity to Parameter Values.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

The (Un)reliability of saliency methods.

[BibT_eX]

[DOI]

CoRR, 2017

Proceedings of the 2017 ICML Workshop on Human Interpretability in Machine Learning (WHI 2017).

[BibT_eX]

[DOI]

CoRR, 2017

SmoothGrad: removing noise by adding noise.

[BibT_eX]

[DOI]

CoRR, 2017

A Roadmap for a Rigorous Science of Interpretability.

[BibT_eX]

[DOI]

Finale Doshi-Velez

CoRR, 2017

QSAnglyzer: Visual Analytics for Prismatic Analysis of Question Answering System Evaluations.

[BibT_eX]

[DOI]

Nan-Chen Chen

Proceedings of the 12th IEEE Conference on Visual Analytics Science and Technology, 2017

2016

Proceedings of the 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016).

[BibT_eX]

[DOI]

Dmitry M. Malioutov

Kush R. Varshney

CoRR, 2016

Examples are not enough, learn to criticize! Criticism for Interpretability.

[BibT_eX]

[DOI]

Oluwasanmi Koyejo

Rajiv Khanna

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015

Inferring Team Task Plans from Human Meetings: A Generative Modeling Approach with Logic-Based Prior.

[BibT_eX]

[DOI]

Caleb M. Chacha

J. Artif. Intell. Res., 2015

Mind the Gap: A Generative Approach to Interpretable Feature Selection and Extraction.

[BibT_eX]

[DOI]

Finale Doshi-Velez

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Scalable and Interpretable Data Representation for High-Dimensional, Complex Data.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Learning about meetings.

[BibT_eX]

[DOI]

Cynthia Rudin

Data Min. Knowl. Discov., 2014

The Bayesian Case Model: A Generative Approach for Case-Based Reasoning and Prototype Classification.

[BibT_eX]

[DOI]

Cynthia Rudin

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013

Quantitative estimation of the strength of agreements in goal-oriented meetings.

[BibT_eX]

[DOI]

Larry A. M. Bush

Julie Shah

Proceedings of the IEEE International Multi-Disciplinary Conference on Cognitive Methods in Situation Awareness and Decision Support, 2013

Machine Learning for Meeting Analysis.

[BibT_eX]

[DOI]

Cynthia Rudin

Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

Inferring Robot Task Plans from Human Team Meetings: A Generative Modeling Approach with Logic-Based Prior.

[BibT_eX]

[DOI]

Caleb M. Chacha

Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012

Human-Inspired Techniques for Human-Machine Team Planning.

[BibT_eX]

[DOI]