This paper is about a system that extracts principal content words from speech-recognized transcripts of voicemail messages and classifies them into proper names, telephone numbers, dates/times and `other'. The short text summaries generated are suitable for mobile messaging applications. The system uses a set of classifiers to identify the summary words, with each word being identified by a vector of lexical and prosodic features. The features are selected using Parcel, an ROC-based algorithm. We visually compare the role of a large number of individual features and discuss effective ways to combine them. We finally evaluate their performance on manual and automatic transcriptions derived from two different speech recognition systems
Several approaches to automatic speech summarization are discussed below, using the ICSI Meetings co...
Dynamic Time warping, Multimedia indexing, speech processingInternational audienceThe automatic summ...
Speech summarization techniques take human speech as input and then output an abridged version as te...
This paper is about the evaluation of a system that generates short text summaries of voicemail mess...
This article presents trainable methods for extracting principal content words from voicemail messag...
This paper presents a novel data-driven approach to summarizing spoken audio transcripts utilizing l...
This paper is about the evaluation of a system that generates short text summaries of voicemail mess...
This paper describes the development of a system to transcribe and summarize voicemail messages. The...
When a speaker leaves a voicemail message there are prosodic cues that emphasize the important point...
This paper describes an alternative architecture for voicemail data retrieval on the move. It is com...
This paper considers extractive summarization of Chinese spoken documents. In contrast to convention...
Abstract—The task of extractive speech summarization is to select a set of salient sentences from an...
Abstract summarization of conversations is a very challenging task that requires full understanding ...
This paper shows that linguistic techniques along with machine learning can extract high quality nou...
We present results of an empirical study of the usefulness of different types of features in selecti...
Several approaches to automatic speech summarization are discussed below, using the ICSI Meetings co...
Dynamic Time warping, Multimedia indexing, speech processingInternational audienceThe automatic summ...
Speech summarization techniques take human speech as input and then output an abridged version as te...
This paper is about the evaluation of a system that generates short text summaries of voicemail mess...
This article presents trainable methods for extracting principal content words from voicemail messag...
This paper presents a novel data-driven approach to summarizing spoken audio transcripts utilizing l...
This paper is about the evaluation of a system that generates short text summaries of voicemail mess...
This paper describes the development of a system to transcribe and summarize voicemail messages. The...
When a speaker leaves a voicemail message there are prosodic cues that emphasize the important point...
This paper describes an alternative architecture for voicemail data retrieval on the move. It is com...
This paper considers extractive summarization of Chinese spoken documents. In contrast to convention...
Abstract—The task of extractive speech summarization is to select a set of salient sentences from an...
Abstract summarization of conversations is a very challenging task that requires full understanding ...
This paper shows that linguistic techniques along with machine learning can extract high quality nou...
We present results of an empirical study of the usefulness of different types of features in selecti...
Several approaches to automatic speech summarization are discussed below, using the ICSI Meetings co...
Dynamic Time warping, Multimedia indexing, speech processingInternational audienceThe automatic summ...
Speech summarization techniques take human speech as input and then output an abridged version as te...