We present a conversational telephone speech data set designed to support research on novel acoustic models. Small vocabulary tasks from 10 words up to 500 words are defined using subsets of the Switchboard-1 corpus; each task has a completely closed vocabulary (an OOV rate of 0%). We justify the need for these tasks, describe the algorithm for selecting them from a large corpus, give a statistical analysis of the data and present baseline whole-word hidden Markov model recognition results. The goal of the paper is to define a common data set and to encourage other researchers to use it
Humans are able to recognise a word before its acoustic realisation is complete. This in contrast to...
Gemmeke J.F., Sehgal S., Cunningham S., ''Fast vocabulary learning for disordered speech vocal inter...
International audienceDistant-microphone automatic speech recognition (ASR) remains a challenging go...
SWITCHBOARD (SWB) Corpus consists of 2438 conversations digitally recorded over long distance teleph...
This paper addresses a critical problem in deploying a spoken dialog system (SDS). One of the main b...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...
International audienceReplicability of scientific studies grounded on language corpora requires a ca...
Based on the observation that the unpredictable nature of conversational speech makes it almost impo...
In this letter, we introduce a new pruning strategy for large vocabulary continuous speech recogniti...
Recognition of conversational speech is one of the most challenging speech recognition tasks to-date...
The current “state-of-the-art ” in phonetic speaker recognition uses relative frequencies of phone n...
Summarization: We describe an approach for the estimation of acoustic phonetic models that will be u...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
This paper discusses a usage of a mumble model in a Czech telephone dialogue system designed and con...
Automatic speech récognition currently arouses a great interest: it can be considered as a significa...
Humans are able to recognise a word before its acoustic realisation is complete. This in contrast to...
Gemmeke J.F., Sehgal S., Cunningham S., ''Fast vocabulary learning for disordered speech vocal inter...
International audienceDistant-microphone automatic speech recognition (ASR) remains a challenging go...
SWITCHBOARD (SWB) Corpus consists of 2438 conversations digitally recorded over long distance teleph...
This paper addresses a critical problem in deploying a spoken dialog system (SDS). One of the main b...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...
International audienceReplicability of scientific studies grounded on language corpora requires a ca...
Based on the observation that the unpredictable nature of conversational speech makes it almost impo...
In this letter, we introduce a new pruning strategy for large vocabulary continuous speech recogniti...
Recognition of conversational speech is one of the most challenging speech recognition tasks to-date...
The current “state-of-the-art ” in phonetic speaker recognition uses relative frequencies of phone n...
Summarization: We describe an approach for the estimation of acoustic phonetic models that will be u...
To obtain a robust acoustic model for a certain speech recognition task, a large amount of speech da...
This paper discusses a usage of a mumble model in a Czech telephone dialogue system designed and con...
Automatic speech récognition currently arouses a great interest: it can be considered as a significa...
Humans are able to recognise a word before its acoustic realisation is complete. This in contrast to...
Gemmeke J.F., Sehgal S., Cunningham S., ''Fast vocabulary learning for disordered speech vocal inter...
International audienceDistant-microphone automatic speech recognition (ASR) remains a challenging go...