We present a dataset for evaluating the grammatical sophistication of language models (LMs). We construct a large number of minimal pairs illustrating constraints on subject-verb agreement, reflexive anaphora and negative polarity items, in several English constructions; we expect LMs to assign a higher probability to the grammatical member of each minimal pair. An LSTM LM performed poorly in many cases. Multi-task training with a syntactic objective improved the LSTM’s accuracy, which nevertheless remained far lower than the accuracy of human participants. This suggests that there is considerable room for improvement over LSTMs in capturing syntax in an LM
We consider the extent to which different deep neural network (DNN) configurations can learn syntact...
How cross-linguistically applicable are NLP models, specifically language models? A fair comparison ...
Natural Language is highly ambiguous, on every level. This article describes a fast broad-coverage s...
We introduce BLiMP (The Benchmark of Linguistic Minimal Pairs), a human-solvable challenge set for e...
We introduce The Benchmark of Linguistic Minimal Pairs (BLiMP),1 a challenge set for evaluating the ...
The binary nature of grammaticality judgments and their use to access the structure of syntax are a ...
The binary nature of grammaticality judgments and their use to access the structure of syntax are a ...
The binary nature of grammaticality judgments and their use to access the structure of syntax are a ...
LSTMs have proven very successful at language modeling. However, it remains unclear to what extent t...
Grammar-based natural language processing has reached a level where it can `understand' language to ...
Most acceptability judgments reported in the syntactic literature are obtained by linguists being th...
Juzek TS, Häussler J. Data convergence in syntactic theory and the role of sentence pairs. Zeitschri...
© 2019 Association for Computational Linguistics State-of-the-art LSTM language models trained on la...
An emerging line of work uses psycholinguistic methods to evaluate the syntactic generalizations acq...
© 2019 Association for Computational Linguistics State-of-the-art LSTM language models trained on la...
We consider the extent to which different deep neural network (DNN) configurations can learn syntact...
How cross-linguistically applicable are NLP models, specifically language models? A fair comparison ...
Natural Language is highly ambiguous, on every level. This article describes a fast broad-coverage s...
We introduce BLiMP (The Benchmark of Linguistic Minimal Pairs), a human-solvable challenge set for e...
We introduce The Benchmark of Linguistic Minimal Pairs (BLiMP),1 a challenge set for evaluating the ...
The binary nature of grammaticality judgments and their use to access the structure of syntax are a ...
The binary nature of grammaticality judgments and their use to access the structure of syntax are a ...
The binary nature of grammaticality judgments and their use to access the structure of syntax are a ...
LSTMs have proven very successful at language modeling. However, it remains unclear to what extent t...
Grammar-based natural language processing has reached a level where it can `understand' language to ...
Most acceptability judgments reported in the syntactic literature are obtained by linguists being th...
Juzek TS, Häussler J. Data convergence in syntactic theory and the role of sentence pairs. Zeitschri...
© 2019 Association for Computational Linguistics State-of-the-art LSTM language models trained on la...
An emerging line of work uses psycholinguistic methods to evaluate the syntactic generalizations acq...
© 2019 Association for Computational Linguistics State-of-the-art LSTM language models trained on la...
We consider the extent to which different deep neural network (DNN) configurations can learn syntact...
How cross-linguistically applicable are NLP models, specifically language models? A fair comparison ...
Natural Language is highly ambiguous, on every level. This article describes a fast broad-coverage s...