International audienceTopic segmentation classically relies on one of two criteria, either finding areas with coherent vocabulary use or detecting discontinuities. In this paper, we propose a segmentation criterion combining both lexical cohesion and disruption, enabling a trade-off between the two. We provide the mathematical formulation of the criterion and an efficient graph based decoding algorithm for topic segmentation. Experimental results on standard textual data sets and on a more challenging corpus of automatically transcribed broadcast news shows demonstrate the benefit of such a combination. Gains were observed in all conditions, with segments of either regular or varying length and abrupt or smooth topic shifts. Long segments b...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we propose a course-grained NLP approach to text segmentation based on the\ud analysis...
We introduce a novel topic segmentation approach that combines evidence of topic shifts from lexical...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...
International audienceTopic segmentation traditionally relies on lexical cohesion measured through w...
International audienceTopic segmentation traditionally relies on lexical cohesion measured through w...
Most documents are about more than one subject, but the majority of natural language processing algo...
Most documents are about more than one subject, but the majority of natural language processing algo...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
. We investigate the problem of text segmentation by topic. Applications for this task include topic...
International audienceSeveral evaluation metrics have been proposed for topic seg-mentation. Most of...
We propose a maximum lexical cohesion (MLC) approach to news story segmentation. Unlike sentence-dep...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we propose a course-grained NLP approach to text segmentation based on the\ud analysis...
We introduce a novel topic segmentation approach that combines evidence of topic shifts from lexical...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...
International audienceTopic segmentation traditionally relies on lexical cohesion measured through w...
International audienceTopic segmentation traditionally relies on lexical cohesion measured through w...
Most documents are about more than one subject, but the majority of natural language processing algo...
Most documents are about more than one subject, but the majority of natural language processing algo...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
. We investigate the problem of text segmentation by topic. Applications for this task include topic...
International audienceSeveral evaluation metrics have been proposed for topic seg-mentation. Most of...
We propose a maximum lexical cohesion (MLC) approach to news story segmentation. Unlike sentence-dep...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis o...
In this paper we propose a course-grained NLP approach to text segmentation based on the\ud analysis...
We introduce a novel topic segmentation approach that combines evidence of topic shifts from lexical...