Machine-learning models have demonstrated great success in learning complex patterns that enable them to make predictions about unobserved data. In addition to using models for prediction, the ability to interpret what a model has learned is receiving an increasing amount of attention. These interpretations have found a number of uses, ranging from providing scientific insight to auditing the predictions themselves to ensure fairness with respect to protected categories like race or gender. However, there is still considerable confusion about the notion of interpretability. In particular, it is currently unclear both what it means to interpret a ML model, and how to actually do so.In the first part of this thesis, we address the foundationa...
In this review, we examine the problem of designing interpretable and explainable machine learning m...
In this thesis we investigate different interpretability methods for evaluating predictions from Con...
Deep neural networks (DNNs) has attracted much attention in machine learning community due to its st...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
The recent surge in highly successful, but opaque, machine-learning models has given rise to a dire ...
Deep neural networks have achieved near-human accuracy levels in various types of classification and...
As the use of deep learning techniques has grown across various fields over the past decade, complai...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
Despite the high accuracy offered by state-of-the-art deep natural-language models (e.g., LSTM, BERT...
Abstract. Data of different levels of complexity and of ever growing diversity of characteristics ar...
Recent severe failures of black box models in high stakes decisions have increased interest in inter...
10 pages, 22 figures, submitted to ICLR 2023A wide variety of model explanation approaches have been...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In this review, we examine the problem of designing interpretable and explainable machine learning m...
In this thesis we investigate different interpretability methods for evaluating predictions from Con...
Deep neural networks (DNNs) has attracted much attention in machine learning community due to its st...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
The recent surge in highly successful, but opaque, machine-learning models has given rise to a dire ...
Deep neural networks have achieved near-human accuracy levels in various types of classification and...
As the use of deep learning techniques has grown across various fields over the past decade, complai...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
Machine-learning models have demonstrated great success in learning complex patterns that enable the...
Despite the high accuracy offered by state-of-the-art deep natural-language models (e.g., LSTM, BERT...
Abstract. Data of different levels of complexity and of ever growing diversity of characteristics ar...
Recent severe failures of black box models in high stakes decisions have increased interest in inter...
10 pages, 22 figures, submitted to ICLR 2023A wide variety of model explanation approaches have been...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In this review, we examine the problem of designing interpretable and explainable machine learning m...
In this thesis we investigate different interpretability methods for evaluating predictions from Con...
Deep neural networks (DNNs) has attracted much attention in machine learning community due to its st...