International audienceThe automatic text segmentation task consists of identifying the most important thematic breaks in a document in order to cut it into homogeneous passages. Text segmentation has motivated a large amount of research. We focus here on the statistical approaches that rely on an analysis of the distribution of the words in the text. Usually, the segmentation of texts is realized sequentially on the basis of very local clues. However, such an approach prevents the consideration of the text in a global way, particularly concerning the granularity degree adopted for the expression of the different topics it addresses. We thus propose here two new segmentation algorithms—ClassStruggle and SegGen—which use criteria rendering gl...
This paper presents TextTiling, a method for partitioning full-length text documents into coherent m...
Segmentation has been used in different natural language processing tasks, such as information retri...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
The automatic text segmentation task consists of identifying the most important thematic breaks in a...
International audienceThe thematic text segmentation task consists in identifying the most important...
The thematic text segmentation task consists in identifying the most important thematic breaks in a ...
Automatic text segmentation, which is the task of breaking a text into topically-consistent segments...
. This paper introduces a new statistical approach to automatically partitioning text into coherent ...
This paper introduces a new statistical approach to partitioning text automatically into coherent se...
Abstract. We present divSeg, a novel method for text segmentation that iteratively splits a portion ...
International audienceSegGen [1] is a linear thematic segmentation algorithm grounded on a variant o...
The current approaches to the analysis of natural language text are not viable for documents of unr...
Automatic document segmentation gets more and more attention in the natural language processing fiel...
We introduce a novel unsupervised algorithm for text segmentation. We re-conceptualize text segmenta...
In this paper we introduce a machine learning approach for automatic text segmentation
This paper presents TextTiling, a method for partitioning full-length text documents into coherent m...
Segmentation has been used in different natural language processing tasks, such as information retri...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
The automatic text segmentation task consists of identifying the most important thematic breaks in a...
International audienceThe thematic text segmentation task consists in identifying the most important...
The thematic text segmentation task consists in identifying the most important thematic breaks in a ...
Automatic text segmentation, which is the task of breaking a text into topically-consistent segments...
. This paper introduces a new statistical approach to automatically partitioning text into coherent ...
This paper introduces a new statistical approach to partitioning text automatically into coherent se...
Abstract. We present divSeg, a novel method for text segmentation that iteratively splits a portion ...
International audienceSegGen [1] is a linear thematic segmentation algorithm grounded on a variant o...
The current approaches to the analysis of natural language text are not viable for documents of unr...
Automatic document segmentation gets more and more attention in the natural language processing fiel...
We introduce a novel unsupervised algorithm for text segmentation. We re-conceptualize text segmenta...
In this paper we introduce a machine learning approach for automatic text segmentation
This paper presents TextTiling, a method for partitioning full-length text documents into coherent m...
Segmentation has been used in different natural language processing tasks, such as information retri...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...