The lack of large and reliable datasets has been hindering progress in Text Simplification (TS). We investigate the application of the recently created Newsela corpus, the largest collection of professionally written simplifications available, in TS tasks. Using new alignment algorithms, we extract 550,644 complex-simple sentence pairs from the corpus. This data is explored in different ways: (i) we show that traditional readability metrics capture surprisingly well the different complexity levels in this corpus, (ii) we build machine learning models to classify sentences into complex vs. simple and to predict complexity levels that outperform their respective baselines, (iii) we introduce a lexical simplifier that uses the corpus to genera...
We propose a new method for evaluating the readability of simplified sentences through pair-wise ran...
Current Automatic Text Simplification (TS) work relies on sequence-to-sequence neural models that le...
In order to simplify a sentence, human editors perform multiple rewriting transformations: they spli...
Current research in text simplification has been hampered by two central problems: (i) the small amo...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
While there is a vast amount of text written about nearly any topic, this is often difficult for som...
Many texts we encounter in our everyday lives are lexically and syntactically very complex. This m...
Many texts we encounter in our everyday lives are lexically and syntactically very complex. This m...
Sentence Simplification (SS) aims to modify a sentence in order to make it easier to read and unders...
Text simplification (TS), defined narrowly, is the process of reducing the linguistic complexity of ...
Sentence Simplification (SS) aims to modify a sentence in order to make it easier to read and unders...
We propose a new method for evaluating the readability of simplified sentences through pair-wise ran...
Current Automatic Text Simplification (TS) work relies on sequence-to-sequence neural models that le...
In order to simplify a sentence, human editors perform multiple rewriting transformations: they spli...
Current research in text simplification has been hampered by two central problems: (i) the small amo...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
We provide several methods for sentence alignment of texts with different complexity levels. Using t...
While there is a vast amount of text written about nearly any topic, this is often difficult for som...
Many texts we encounter in our everyday lives are lexically and syntactically very complex. This m...
Many texts we encounter in our everyday lives are lexically and syntactically very complex. This m...
Sentence Simplification (SS) aims to modify a sentence in order to make it easier to read and unders...
Text simplification (TS), defined narrowly, is the process of reducing the linguistic complexity of ...
Sentence Simplification (SS) aims to modify a sentence in order to make it easier to read and unders...
We propose a new method for evaluating the readability of simplified sentences through pair-wise ran...
Current Automatic Text Simplification (TS) work relies on sequence-to-sequence neural models that le...
In order to simplify a sentence, human editors perform multiple rewriting transformations: they spli...