International audienceIdentifying topical structure in any text-like data is a challenging task. Most existing techniques rely either on maximizing a measure of the lexical cohesion or on detecting lexical disruptions. A novel method combining the two criteria so as to obtain the best trade-off between cohesion and disruption is proposed in this paper. A new statistical model is defined, based on the work of Isahara and Utiyama (2001), maintaining the properties of domain independence and limited a priori of the latter. Evaluations are performed both on written texts and on automatic transcripts of TV shows, the latter not respecting the norms of written texts, thus increasing the difficulty of the task. Experimental results demonstrate the...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...
. This paper introduces a new statistical approach to automatically partitioning text into coherent ...
International audienceWe present a method for story segmentation of radio broadcast news, based on l...
International audienceIdentifying topical structure in any text-like data is a challenging task. Mos...
International audienceIdentifying topical structure in any text-like data is a challenging task. Mos...
International audienceIdentifying topical structure in any text-like data is a challenging task. Mos...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTopic segmentation methods based on a measure of the lexical cohesion can be a...
International audienceTopic segmentation methods based on a measure of the lexical cohesion can be a...
. This paper introduces a new statistical approach to automatically partitioning text into coherent ...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...
. This paper introduces a new statistical approach to automatically partitioning text into coherent ...
International audienceWe present a method for story segmentation of radio broadcast news, based on l...
International audienceIdentifying topical structure in any text-like data is a challenging task. Mos...
International audienceIdentifying topical structure in any text-like data is a challenging task. Mos...
International audienceIdentifying topical structure in any text-like data is a challenging task. Mos...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
International audienceTopic segmentation classically relies on one of two criteria, either finding a...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTranscript-based topic segmentation of TV programs faces several difficulties ...
International audienceTopic segmentation methods based on a measure of the lexical cohesion can be a...
International audienceTopic segmentation methods based on a measure of the lexical cohesion can be a...
. This paper introduces a new statistical approach to automatically partitioning text into coherent ...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...
. This paper introduces a new statistical approach to automatically partitioning text into coherent ...
International audienceWe present a method for story segmentation of radio broadcast news, based on l...