The recent explosion of available audio-visual media is the new challenge for information retrieval research. Audio speech recognition systems translate spoken content to the text domain. There is a need for searching and indexing this data which possesses no logical structure. One possible way to structure it on a high level of abstraction is by finding topic boundaries. Two unsupervised topic segmentation methods were evaluated with real-world data in the course of this work. The first one, TSF, models topic shifts as fluctuations in the similarity function of the transcript. The second one, LCSeg, approaches topic changes as places with the least overlapping lexical chains. Only LCSeg performed close to a similar real-world corpus. Other...
In order to solve problems of reliability of systems based on lexical repetition and problems of ada...
We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmenta...
Neural sentence encoders (NSE) are effective in many NLP tasks, including topic segmentation. Howeve...
The recent explosion of available audio-visual media is the new challenge for information retrieval ...
Most documents are about more than one subject, but the majority of natural language processing algo...
. We investigate the problem of text segmentation by topic. Applications for this task include topic...
International audienceTopic segmentation methods are mostly based on the idea of lexical cohesion, i...
The growth in the collections of multimedia documents made the development of new data access and da...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...
We address the task of unsupervised topic segmentation of speech data operating over raw acoustic in...
In this article we address the task of automatic text structuring into linear and non-overlapping th...
Most documents are aboutmore than one subject, but the majority of natural language processing algor...
In this research, topic segmentation in texts (a.k.a. text segmentation) is used as a proxy for top...
International audienceSeveral evaluation metrics have been proposed for topic seg-mentation. Most of...
Information Retrieval systems determine relevance by comparing information needs with the content of...
In order to solve problems of reliability of systems based on lexical repetition and problems of ada...
We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmenta...
Neural sentence encoders (NSE) are effective in many NLP tasks, including topic segmentation. Howeve...
The recent explosion of available audio-visual media is the new challenge for information retrieval ...
Most documents are about more than one subject, but the majority of natural language processing algo...
. We investigate the problem of text segmentation by topic. Applications for this task include topic...
International audienceTopic segmentation methods are mostly based on the idea of lexical cohesion, i...
The growth in the collections of multimedia documents made the development of new data access and da...
Topic segmentation classically relies on one of two criteria, either finding areas with co-herent vo...
We address the task of unsupervised topic segmentation of speech data operating over raw acoustic in...
In this article we address the task of automatic text structuring into linear and non-overlapping th...
Most documents are aboutmore than one subject, but the majority of natural language processing algor...
In this research, topic segmentation in texts (a.k.a. text segmentation) is used as a proxy for top...
International audienceSeveral evaluation metrics have been proposed for topic seg-mentation. Most of...
Information Retrieval systems determine relevance by comparing information needs with the content of...
In order to solve problems of reliability of systems based on lexical repetition and problems of ada...
We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmenta...
Neural sentence encoders (NSE) are effective in many NLP tasks, including topic segmentation. Howeve...