In this paper we propose a course-grained NLP approach to text segmentation based on the\ud analysis of lexical cohesion within text. Most work in this area has focused on the discovery of textual\ud units that discuss subtopic structure within documents. In contrast our segmentation task requires the discovery of topical units of text i.e. distinct news stories from broadcast news programmes. Our system SeLeCT first builds a set of lexical chains, in order to model the discourse structure of the text. A boundary detector is then used to search for breaking points in this structure indicated by patterns of cohesive strength and weakness within the text. We evaluate this technique on a test set of concatenated CNN news story transcripts and ...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
In this paper we compare the performance of three dis-tinct approaches to lexical cohesion based tex...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
We propose a maximum lexical cohesion (MLC) approach to news story segmentation. Unlike sentence-dep...
In this paper we describe an extractive method of creating very short summaries or gists that captur...
In this paper we describe an extractive method of creating very short summaries or gists that captur...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
In this paper we compare the performance of three dis-tinct approaches to lexical cohesion based tex...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
We propose a maximum lexical cohesion (MLC) approach to news story segmentation. Unlike sentence-dep...
In this paper we describe an extractive method of creating very short summaries or gists that captur...
In this paper we describe an extractive method of creating very short summaries or gists that captur...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...