Although attention mechanisms have been applied to a variety of deep learning models and have been shown to improve the prediction performance, it has been reported to be vulnerable to perturbations to the mechanism. To overcome the vulnerability to perturbations in the mechanism, we are inspired by adversarial training (AT), which is a powerful regularization technique for enhancing the robustness of the models. In this paper, we propose a general training technique for natural language processing tasks, including AT for attention (Attention AT) and more interpretable AT for attention (Attention iAT). The proposed techniques improved the prediction performance and the model interpretability by exploiting the mechanisms with AT. In particul...
Image classification systems are known to be vulnerable to adversarial attacks, which are impercepti...
Ochanomizu UniversityJohn Hopkins UniversityKyoto UniversityNational Institute for Japanese Language...
The monumental achievements of deep learning (DL) systems seem to guarantee the absolute superiority...
Although attention mechanisms have been applied to a variety of deep learning models and have been s...
Although attention mechanisms have become fundamental components of deep learning models, they are v...
With the dramatic advances in deep learning technology, machine learning research is focusing on imp...
Deep neural models (e.g. Transformer) naturally learn spurious features, which create a ``shortcut''...
Neural language models show vulnerability to adversarial examples which are semantically similar to ...
International audienceAttention mechanism is contributing to the majority of recent advances in mach...
Attention mechanism has become a standard fixture in many state-of-the-art natural language processi...
Natural Language Inference (NLI) models are known to learn from biases and artefacts within their tr...
Deep neural networks are known to be vulnerable to adversarial attacks. The empirical analysis in ou...
In this paper, we introduce a new approach to Reinforcement Learning (RL) called “supervised attenti...
Learning attention functions requires large volumes of data, but many NLP tasks simulate human behav...
Attention is an increasingly popular mechanism used in a wide range of neural architectures. The mec...
Image classification systems are known to be vulnerable to adversarial attacks, which are impercepti...
Ochanomizu UniversityJohn Hopkins UniversityKyoto UniversityNational Institute for Japanese Language...
The monumental achievements of deep learning (DL) systems seem to guarantee the absolute superiority...
Although attention mechanisms have been applied to a variety of deep learning models and have been s...
Although attention mechanisms have become fundamental components of deep learning models, they are v...
With the dramatic advances in deep learning technology, machine learning research is focusing on imp...
Deep neural models (e.g. Transformer) naturally learn spurious features, which create a ``shortcut''...
Neural language models show vulnerability to adversarial examples which are semantically similar to ...
International audienceAttention mechanism is contributing to the majority of recent advances in mach...
Attention mechanism has become a standard fixture in many state-of-the-art natural language processi...
Natural Language Inference (NLI) models are known to learn from biases and artefacts within their tr...
Deep neural networks are known to be vulnerable to adversarial attacks. The empirical analysis in ou...
In this paper, we introduce a new approach to Reinforcement Learning (RL) called “supervised attenti...
Learning attention functions requires large volumes of data, but many NLP tasks simulate human behav...
Attention is an increasingly popular mechanism used in a wide range of neural architectures. The mec...
Image classification systems are known to be vulnerable to adversarial attacks, which are impercepti...
Ochanomizu UniversityJohn Hopkins UniversityKyoto UniversityNational Institute for Japanese Language...
The monumental achievements of deep learning (DL) systems seem to guarantee the absolute superiority...