© 2018 Curran Associates Inc.All rights reserved. Most recent work on interpretability of complex machine learning models has focused on estimating a posteriori explanations for previously trained models around specific predictions. Self-explaining models where interpretability plays a key role already during learning have received much less attention. We propose three desiderata for explanations in general - explicitness, faithfulness, and stability - and show that existing methods do not satisfy them. In response, we design self-explaining models in stages, progressively generalizing linear classifiers to complex yet architecturally explicit models. Faithfulness and stability are enforced via regularization specifically tailored to such m...
Modern machine learning methods allow for complex and in-depth analytics, but the predictive models ...
Explainable models in machine learning are increas- ingly popular due to the interpretability-favori...
In the field of neural networks, there has been a long-standing problem that needs to be addressed: ...
© 2018 Curran Associates Inc.All rights reserved. Most recent work on interpretability of complex ma...
10 pages, 22 figures, submitted to ICLR 2023A wide variety of model explanation approaches have been...
Self-explainable deep neural networks are a recent class of models that can output ante-hoc local ex...
In this review, we examine the problem of designing interpretable and explainable machine learning m...
We take inspiration from the study of human explanation to inform the design and evaluation of inter...
Explanations shed light on a machine learning model's rationales and can aid in identifying deficien...
Safety-critical applications (e.g., autonomous vehicles, human-machine teaming, and automated medica...
In recent decades, artificial intelligence (AI) systems are becoming increasingly ubiquitous from lo...
36 pages, including 9 pages of main text. This is the author version of the NeurIPS'2020 paper. Exce...
End-to-end neural NLP architectures are notoriously difficult to understand, which gives rise to num...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
Recent work on interpretability in machine learning and AI has focused on the building of simplified...
Modern machine learning methods allow for complex and in-depth analytics, but the predictive models ...
Explainable models in machine learning are increas- ingly popular due to the interpretability-favori...
In the field of neural networks, there has been a long-standing problem that needs to be addressed: ...
© 2018 Curran Associates Inc.All rights reserved. Most recent work on interpretability of complex ma...
10 pages, 22 figures, submitted to ICLR 2023A wide variety of model explanation approaches have been...
Self-explainable deep neural networks are a recent class of models that can output ante-hoc local ex...
In this review, we examine the problem of designing interpretable and explainable machine learning m...
We take inspiration from the study of human explanation to inform the design and evaluation of inter...
Explanations shed light on a machine learning model's rationales and can aid in identifying deficien...
Safety-critical applications (e.g., autonomous vehicles, human-machine teaming, and automated medica...
In recent decades, artificial intelligence (AI) systems are becoming increasingly ubiquitous from lo...
36 pages, including 9 pages of main text. This is the author version of the NeurIPS'2020 paper. Exce...
End-to-end neural NLP architectures are notoriously difficult to understand, which gives rise to num...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
Recent work on interpretability in machine learning and AI has focused on the building of simplified...
Modern machine learning methods allow for complex and in-depth analytics, but the predictive models ...
Explainable models in machine learning are increas- ingly popular due to the interpretability-favori...
In the field of neural networks, there has been a long-standing problem that needs to be addressed: ...