Many believe human-level natural language inference (NLI) has already been achieved. In reality, modern NLI benchmarks have serious flaws, rendering progress questionable. Chief among them is the problem of single sentence label leakage, where spurious correlations and biases in datasets enable the accurate prediction of a sentence pair relation from only a single sentence, something that should in principle be impossible. This leakage enables models to cheat rather than learn the desired reasoning capabilities, and hasn't gone away since its 2018 discovery. We analyze this problem across 10 modern NLI datasets, and find that new datasets have a single sentence accuracy of 8% over chance at best and 19% on average. We examine how regular NL...
We investigate how disagreement in natural language inference (NLI) annotation arises. We developed ...
Natural Language Inference (NLI) models are known to learn from biases and artefacts within their tr...
Natural Language Inference (NLI) research involves the development of models that can mimic human in...
A central question in natural language understanding (NLU) research is whether high performance demo...
Natural Language Inference (NLI) plays an important role in many natural language processing tasks s...
Nature language inference (NLI) task is a predictive task of determining the inference relationship ...
Natural Language Inference (NLI) has been extensively studied by the NLP community as a framework fo...
Natural Language Inference (NLI) datasets contain annotation artefacts resulting in spurious correla...
© 2019 Association for Computational Linguistics The task of Natural Language Inference (NLI) is wid...
We present a large-scale collection of diverse natural language inference (NLI) datasets that help p...
Do state-of-the-art models for language understanding already have, or can they easily learn, abilit...
Success in natural language inference (NLI) should require a model to understand both lexical and co...
Natural Language Inference is a challenging task that has received substantial attention, and state-...
Natural Language Inference is a challenging task that has received substantial attention, and state-...
Natural language understanding (NLU) models tend to rely on spurious correlations (i.e., dataset bia...
We investigate how disagreement in natural language inference (NLI) annotation arises. We developed ...
Natural Language Inference (NLI) models are known to learn from biases and artefacts within their tr...
Natural Language Inference (NLI) research involves the development of models that can mimic human in...
A central question in natural language understanding (NLU) research is whether high performance demo...
Natural Language Inference (NLI) plays an important role in many natural language processing tasks s...
Nature language inference (NLI) task is a predictive task of determining the inference relationship ...
Natural Language Inference (NLI) has been extensively studied by the NLP community as a framework fo...
Natural Language Inference (NLI) datasets contain annotation artefacts resulting in spurious correla...
© 2019 Association for Computational Linguistics The task of Natural Language Inference (NLI) is wid...
We present a large-scale collection of diverse natural language inference (NLI) datasets that help p...
Do state-of-the-art models for language understanding already have, or can they easily learn, abilit...
Success in natural language inference (NLI) should require a model to understand both lexical and co...
Natural Language Inference is a challenging task that has received substantial attention, and state-...
Natural Language Inference is a challenging task that has received substantial attention, and state-...
Natural language understanding (NLU) models tend to rely on spurious correlations (i.e., dataset bia...
We investigate how disagreement in natural language inference (NLI) annotation arises. We developed ...
Natural Language Inference (NLI) models are known to learn from biases and artefacts within their tr...
Natural Language Inference (NLI) research involves the development of models that can mimic human in...