Information Retrieval systems determine relevance by comparing information needs with the content of potential retrieval units. Unlike most textual data, automatically generated speech transcripts cannot by default be easily divided into obvious retrieval units due to a lack of explicit structural markers. This problem can be addressed by automatically detecting topically cohesive segments, or stories. However, when the content collection consists of speech from less formal domains than broadcast news, most of the standard automatic boundary detection methods are potentially unsuitable due to their reliance on learned features. In particular for conversational speech, the lack of adequate training data can present a significant issue. In th...
We explore the use of prosodic features beyond pauses, including duration, pitch, and energy feature...
This paper presents experiments on sentence boundary detection in transcripts of spoken dialogues. S...
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent proce...
Spoken Document Retrieval (SDR) is usually implemented by using an Information Retrieval (IR) engine...
We propose a maximum lexical cohesion (MLC) approach to news story segmentation. Unlike sentence-dep...
We propose an acoustic TextTiling method based on segmen-tal dynamic time warping for automatic stor...
The recent explosion of available audio-visual media is the new challenge for information retrieval ...
Although speech recognition technology has significantly improved during the past few decades, curre...
The Informedia Digital Library Project [Wactlar96] allows full content indexing and retrieval of tex...
We compare the effect of different text segmentation strategies on speech based passage retrieval of...
This paper investigates the issue of automatic segmentation of speech recordings for broadcast news ...
International audience— In this paper we present an integrated unsupervised method to produce a qual...
This paper presents an approach to identifying sentence boundaries in broadcast speech transcripts. ...
This thesis studies Sentence Unit Detection (SUD) that uses lexical information for Automatic Speech...
In this thesis, research on large vocabulary continuous speech recognition for unknown audio conditi...
We explore the use of prosodic features beyond pauses, including duration, pitch, and energy feature...
This paper presents experiments on sentence boundary detection in transcripts of spoken dialogues. S...
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent proce...
Spoken Document Retrieval (SDR) is usually implemented by using an Information Retrieval (IR) engine...
We propose a maximum lexical cohesion (MLC) approach to news story segmentation. Unlike sentence-dep...
We propose an acoustic TextTiling method based on segmen-tal dynamic time warping for automatic stor...
The recent explosion of available audio-visual media is the new challenge for information retrieval ...
Although speech recognition technology has significantly improved during the past few decades, curre...
The Informedia Digital Library Project [Wactlar96] allows full content indexing and retrieval of tex...
We compare the effect of different text segmentation strategies on speech based passage retrieval of...
This paper investigates the issue of automatic segmentation of speech recordings for broadcast news ...
International audience— In this paper we present an integrated unsupervised method to produce a qual...
This paper presents an approach to identifying sentence boundaries in broadcast speech transcripts. ...
This thesis studies Sentence Unit Detection (SUD) that uses lexical information for Automatic Speech...
In this thesis, research on large vocabulary continuous speech recognition for unknown audio conditi...
We explore the use of prosodic features beyond pauses, including duration, pitch, and energy feature...
This paper presents experiments on sentence boundary detection in transcripts of spoken dialogues. S...
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent proce...