This paper presents TextTiling, a method for partitioning full-length text documents into coherent multiparagraph units. The layout of text tiles is meant to reflect the pattern of subtopics contained in an expository text. The approach uses lexical analyses based on tf.idf, an information retrieval measurement, to determine the extent of the tiles, incorporating thesaural information via a statistical disambiguation algorithm. The tiles have been found to correspond well to human judgements of the major subtopic boundaries of science magazine articles
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
A methodology for automatically summarising scientific texts is presented using the patterns of lexi...
Text tiling aims to split long documents into multiple related paragraphs. In this study, the docume...
The aim of the research presented here is to report on a corpus-based method for discourse analysis ...
This paper introduces a new statistical approach to partitioning text automatically into coherent se...
It is well known that some techniques have already been developed to automatically subdivide texts i...
. This paper introduces a new statistical approach to automatically partitioning text into coherent ...
International audienceThe automatic text segmentation task consists of identifying the most importan...
Discourse segmentation is the division of a text into minimal discourse segments, which form the lea...
Abstract. We present divSeg, a novel method for text segmentation that iteratively splits a portion ...
We propose integrating features from lexical cohesion with elements from lay-out recognition to buil...
International audienceThe thematic text segmentation task consists in identifying the most important...
In order to uncover the structure of a discourse or to validate some hypotheses about it, researcher...
The thematic text segmentation task consists in identifying the most important thematic breaks in a ...
We propose integrating features from lexical cohesion with elements from layout recognition to build...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
A methodology for automatically summarising scientific texts is presented using the patterns of lexi...
Text tiling aims to split long documents into multiple related paragraphs. In this study, the docume...
The aim of the research presented here is to report on a corpus-based method for discourse analysis ...
This paper introduces a new statistical approach to partitioning text automatically into coherent se...
It is well known that some techniques have already been developed to automatically subdivide texts i...
. This paper introduces a new statistical approach to automatically partitioning text into coherent ...
International audienceThe automatic text segmentation task consists of identifying the most importan...
Discourse segmentation is the division of a text into minimal discourse segments, which form the lea...
Abstract. We present divSeg, a novel method for text segmentation that iteratively splits a portion ...
We propose integrating features from lexical cohesion with elements from lay-out recognition to buil...
International audienceThe thematic text segmentation task consists in identifying the most important...
In order to uncover the structure of a discourse or to validate some hypotheses about it, researcher...
The thematic text segmentation task consists in identifying the most important thematic breaks in a ...
We propose integrating features from lexical cohesion with elements from layout recognition to build...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
A methodology for automatically summarising scientific texts is presented using the patterns of lexi...
Text tiling aims to split long documents into multiple related paragraphs. In this study, the docume...