We report the results of a study on topic spotting in conversational speech. Using a machine learning approach, we build classifiers that accept an audio file of conversational human speech as input, and output an estimate of the topic being discussed. Our methodology makes use of a wellknown corpus of transcribed and topic-labeled speech (the Switchboard corpus), and involves an interesting double use of the BOOSTEXTER learning algorithm. Our work is distinguished from previous efforts in topic spotting by our explicit study of the effects of dialogue length on classifier performance, and by our use of off-theshelf speech recognition technology. One of our main results is the identification of a single classifier with good performance (rel...
We describe a language independent word burst feature based on the structure of conversational speec...
We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmenta...
In many topic identification applications, supervised training labels are indirectly related to the ...
We report the results of a study on topic spotting in conversational speech. Using a machine learnin...
In this paper we present a new approach for topic spotting based on subword units (phonemes and feat...
Topic detection in dialogue datasets has become a significant challenge for unsupervised and unlabel...
In this paper we present a new approach for topic spotting based on subword units and feature vector...
This thesis is about topic detection from spoken speech. The first part of the thesis deals with spe...
In this paper we present a new approach for topic spotting based on subword units and fea-ture vecto...
We address the task of unsupervised topic segmentation of speech data operating over raw acoustic in...
One of the main challenges in discourse analysis is the process of segmenting text into meaningful ...
We study the problem of topic seg-mentation of manually transcribed speech in order to facilitate in...
This is an introductory tutorial paper for the Special Session on Machine Learning in Spoken Dialogu...
<p>We describe a language independent word burst feature based on the structure of conversational sp...
Colloque avec actes et comité de lecture. nationale.National audienceThis paper presents several met...
We describe a language independent word burst feature based on the structure of conversational speec...
We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmenta...
In many topic identification applications, supervised training labels are indirectly related to the ...
We report the results of a study on topic spotting in conversational speech. Using a machine learnin...
In this paper we present a new approach for topic spotting based on subword units (phonemes and feat...
Topic detection in dialogue datasets has become a significant challenge for unsupervised and unlabel...
In this paper we present a new approach for topic spotting based on subword units and feature vector...
This thesis is about topic detection from spoken speech. The first part of the thesis deals with spe...
In this paper we present a new approach for topic spotting based on subword units and fea-ture vecto...
We address the task of unsupervised topic segmentation of speech data operating over raw acoustic in...
One of the main challenges in discourse analysis is the process of segmenting text into meaningful ...
We study the problem of topic seg-mentation of manually transcribed speech in order to facilitate in...
This is an introductory tutorial paper for the Special Session on Machine Learning in Spoken Dialogu...
<p>We describe a language independent word burst feature based on the structure of conversational sp...
Colloque avec actes et comité de lecture. nationale.National audienceThis paper presents several met...
We describe a language independent word burst feature based on the structure of conversational speec...
We present a probabilistic model that uses both prosodic and lexical cues for the automatic segmenta...
In many topic identification applications, supervised training labels are indirectly related to the ...