Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent processes such as question answering and information retrieval. In previous work, a decision tree trained on automatically extracted lexical and acoustic features was trained to predict story boundaries, using hypothesized sentence boundaries to define potential story boundaries. In this paper, we empirically evaluate several alternatives to choice of segmentation on three languages: English, Mandarin and Arabic. Our results suggest that the best performance can be achieved by using 250ms pause-based segmentation or sentence boundaries determined using a very low confidence score threshold
We propose an acoustic TextTiling method based on segmen-tal dynamic time warping for automatic stor...
The Informedia Digital Library Project [Wactlar96] allows full content indexing and retrieval of tex...
Information Retrieval systems determine relevance by comparing information needs with the content of...
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent proce...
In this paper, we present results from a Broadcast News story segmentation system developed for the ...
Text segmentation is a very critical step to many applications and while it has been addressed exten...
In this paper, we explore the use of prosodic features in sen-tence boundary detection in Chinese br...
We propose a maximum lexical cohesion (MLC) approach to news story segmentation. Unlike sentence-dep...
In this work we aim at enriching the transcript of an automatic speech recognition system with punct...
This paper presents an approach to identifying sentence boundaries in broadcast speech transcripts. ...
Audio-visual content analysis is an area that is receiving increased interest, especially with the a...
In this paper, we propose integration of multimodal features using conditional random fields (CRFs) ...
Traditional unsupervised broadcast news story segmentation ap-proaches have to set the segmentation ...
We explore the use of prosodic features beyond pauses, including duration, pitch, and energy feature...
In this paper a novel news story automatic segmentation scheme based on audio-visual features and te...
We propose an acoustic TextTiling method based on segmen-tal dynamic time warping for automatic stor...
The Informedia Digital Library Project [Wactlar96] allows full content indexing and retrieval of tex...
Information Retrieval systems determine relevance by comparing information needs with the content of...
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent proce...
In this paper, we present results from a Broadcast News story segmentation system developed for the ...
Text segmentation is a very critical step to many applications and while it has been addressed exten...
In this paper, we explore the use of prosodic features in sen-tence boundary detection in Chinese br...
We propose a maximum lexical cohesion (MLC) approach to news story segmentation. Unlike sentence-dep...
In this work we aim at enriching the transcript of an automatic speech recognition system with punct...
This paper presents an approach to identifying sentence boundaries in broadcast speech transcripts. ...
Audio-visual content analysis is an area that is receiving increased interest, especially with the a...
In this paper, we propose integration of multimodal features using conditional random fields (CRFs) ...
Traditional unsupervised broadcast news story segmentation ap-proaches have to set the segmentation ...
We explore the use of prosodic features beyond pauses, including duration, pitch, and energy feature...
In this paper a novel news story automatic segmentation scheme based on audio-visual features and te...
We propose an acoustic TextTiling method based on segmen-tal dynamic time warping for automatic stor...
The Informedia Digital Library Project [Wactlar96] allows full content indexing and retrieval of tex...
Information Retrieval systems determine relevance by comparing information needs with the content of...