International audienceDiscourse segmentation is the first step in building discourse parsers. Most work on discourse segmentation does not scale to real-world discourse parsing across languages , for two reasons: (i) models rely on constituent trees, and (ii) experiments have relied on gold standard identification of sentence and token boundaries. We therefore investigate to what extent constituents can be replaced with universal dependencies , or left out completely, as well as how state-of-the-art segmenters fare in the absence of sentence boundaries. Our results show that dependency information is less useful than expected, but we provide a fully scalable, robust model that only relies on part-of-speech information, and show that it perf...
International audienceSentence splitting involves the segmentation of a sentence into two or more sh...
A discourse constitutes a locally and globally coherent text in which words, clauses and sentences a...
Several discourse annotated corpora now exist for NLP exploitation. Nevertheless, it is not clear ho...
International audienceDiscourse segmentation is a crucial step in building end-to-end discourse pars...
International audienceSegmentation is the first step in building practical discourse parsers, and is...
Recent neural supervised topic segmentation models achieve distinguished superior effectiveness over...
International audienceThis paper presents a novel approach to document-based discourse analysis by p...
International audienceDiscourse segmentation, the first step of discourse analysis, has been shown t...
International audienceWhile discourse segmentation and parsing has made considerable progress in rec...
International audienceIn this article, we propose the first work that investigates the feasibility o...
International audienceFinding word boundaries in continuous speech is challenging as there is little...
International audienceAutomatically detecting discourse segments is an important preliminary step to...
The need to model the relation between discourse structure and linguistic features of ut-terances is...
Computational text-level discourse analysis mostly happens within Rhetorical Structure Theory (RST),...
RST-style discourse parsing plays a vital role in many NLP tasks, revealing the underlying semantic/...
International audienceSentence splitting involves the segmentation of a sentence into two or more sh...
A discourse constitutes a locally and globally coherent text in which words, clauses and sentences a...
Several discourse annotated corpora now exist for NLP exploitation. Nevertheless, it is not clear ho...
International audienceDiscourse segmentation is a crucial step in building end-to-end discourse pars...
International audienceSegmentation is the first step in building practical discourse parsers, and is...
Recent neural supervised topic segmentation models achieve distinguished superior effectiveness over...
International audienceThis paper presents a novel approach to document-based discourse analysis by p...
International audienceDiscourse segmentation, the first step of discourse analysis, has been shown t...
International audienceWhile discourse segmentation and parsing has made considerable progress in rec...
International audienceIn this article, we propose the first work that investigates the feasibility o...
International audienceFinding word boundaries in continuous speech is challenging as there is little...
International audienceAutomatically detecting discourse segments is an important preliminary step to...
The need to model the relation between discourse structure and linguistic features of ut-terances is...
Computational text-level discourse analysis mostly happens within Rhetorical Structure Theory (RST),...
RST-style discourse parsing plays a vital role in many NLP tasks, revealing the underlying semantic/...
International audienceSentence splitting involves the segmentation of a sentence into two or more sh...
A discourse constitutes a locally and globally coherent text in which words, clauses and sentences a...
Several discourse annotated corpora now exist for NLP exploitation. Nevertheless, it is not clear ho...