We build on abduction-based explanations for machine learning and develop a method for computing local explanations for neural network models in natural language processing (NLP). Our explanations comprise a subset of the words of the input text that satisfies two key features: optimality w.r.t. a user-defined cost function, such as the length of explanation, and robustness, in that they ensure prediction invariance for any bounded perturbation in the embedding space of the left-out words. We present two solution algorithms, respectively based on implicit hitting sets and maximum universal subsets, introducing a number of algorithmic improvements to speed up convergence of hard instances. We show how our method can be configured with differ...
The behavior of deep neural networks (DNNs) is hard to understand. This makes it necessary to explor...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We propose an approach to faithfully explaining text classification models, using a specifically des...
Despite the high accuracy offered by state-of-the-art deep natural-language models (e.g., LSTM, BERT...
Explainability of machine learning models is increasing in importance. The reason for this is that t...
The growing range of applications of Machine Learning (ML) in a multitude of settings motivates the ...
Neural network NLP models are vulnerable to small modifications of the input that maintain the origi...
Natural Language Inference (NLI) models are known to learn from biases and artefacts within their tr...
Natural Language Inference (NLI) models are known to learn from biases and artefacts within their tr...
Deep learning, and in particular neural networks (NNs), have seen a surge in popularity over the pas...
Deep Neural Networks such as Recurrent Neural Networks and Transformer models are widely adopted for...
A growing line of work has investigated the development of neural NLP models that can produce ration...
The behavior of deep neural networks (DNNs) is hard to understand. This makes it necessary to explor...
Explaining the decisions of a Deep Learning Network is imperative to safeguard end-user trust. Such ...
The behavior of deep neural networks (DNNs) is hard to understand. This makes it necessary to explor...
The behavior of deep neural networks (DNNs) is hard to understand. This makes it necessary to explor...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We propose an approach to faithfully explaining text classification models, using a specifically des...
Despite the high accuracy offered by state-of-the-art deep natural-language models (e.g., LSTM, BERT...
Explainability of machine learning models is increasing in importance. The reason for this is that t...
The growing range of applications of Machine Learning (ML) in a multitude of settings motivates the ...
Neural network NLP models are vulnerable to small modifications of the input that maintain the origi...
Natural Language Inference (NLI) models are known to learn from biases and artefacts within their tr...
Natural Language Inference (NLI) models are known to learn from biases and artefacts within their tr...
Deep learning, and in particular neural networks (NNs), have seen a surge in popularity over the pas...
Deep Neural Networks such as Recurrent Neural Networks and Transformer models are widely adopted for...
A growing line of work has investigated the development of neural NLP models that can produce ration...
The behavior of deep neural networks (DNNs) is hard to understand. This makes it necessary to explor...
Explaining the decisions of a Deep Learning Network is imperative to safeguard end-user trust. Such ...
The behavior of deep neural networks (DNNs) is hard to understand. This makes it necessary to explor...
The behavior of deep neural networks (DNNs) is hard to understand. This makes it necessary to explor...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We propose an approach to faithfully explaining text classification models, using a specifically des...