This report presents one month trainee work on development of French Automatic Speech Recogni-tion (ASR) system using a french part of multilingual database GlobalPhone FR. The purpose of this report is to explain and give results of the training and testing of the ASR with this specific database. Two different methods are presented, the Hidden Markov Model (HMM) with MFCC/PLP features and tandem features from Multilayer Perceptron (MLP) phone posteriors. The report presents data prepara-tion for GlobalPhone FR ASR training, and compares the two different approaches. Word recognition accuracy achieved with MFCC features is 71.46 % and the tandem features with 3-layer MLP improved the accuracy to 72.15%. We interpret this result as a baselin...
Automatic speech recognition (ASR) technology has achieved a level of maturity, where it is already ...
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult ...
A major research activity at LIMSI is multilingual, speakerindependent, large vocabulary speech dict...
In this paper we report a series of tests carried out on our hybrid HMM/ANN systems which aims at co...
This paper describes the design, collection, and current status of the multilingual database GlobalP...
Standard automatic speech recognition (ASR) systems use phonemes as subword units. Thus, one of the ...
Standard hidden Markov model (HMM) based automatic speech recogni-tion (ASR) systems use phonemes as...
A series of experiments on speaker-independent phone recognition of continuous speech have been carr...
In this paper, we present several methods for mapping recognition engine requirements to mobile phon...
International audienceIn this paper, hidden Markov models (HMM)-based vowel and consonant automatic ...
International audienceThis paper presents CLIPS laboratory activities in speech recognition related ...
Abstract. Automatic speech recognition in database-lacking languages like Romanian must imply new sy...
A renewed focus on foreign language (FL) learning and speech for communication has resulted in compu...
This paper describes the design, collection, and current status of the multilingual database GlobalP...
In recent years, the features derived from posteriors of a multilayer perceptron (MLP), known as tan...
Automatic speech recognition (ASR) technology has achieved a level of maturity, where it is already ...
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult ...
A major research activity at LIMSI is multilingual, speakerindependent, large vocabulary speech dict...
In this paper we report a series of tests carried out on our hybrid HMM/ANN systems which aims at co...
This paper describes the design, collection, and current status of the multilingual database GlobalP...
Standard automatic speech recognition (ASR) systems use phonemes as subword units. Thus, one of the ...
Standard hidden Markov model (HMM) based automatic speech recogni-tion (ASR) systems use phonemes as...
A series of experiments on speaker-independent phone recognition of continuous speech have been carr...
In this paper, we present several methods for mapping recognition engine requirements to mobile phon...
International audienceIn this paper, hidden Markov models (HMM)-based vowel and consonant automatic ...
International audienceThis paper presents CLIPS laboratory activities in speech recognition related ...
Abstract. Automatic speech recognition in database-lacking languages like Romanian must imply new sy...
A renewed focus on foreign language (FL) learning and speech for communication has resulted in compu...
This paper describes the design, collection, and current status of the multilingual database GlobalP...
In recent years, the features derived from posteriors of a multilayer perceptron (MLP), known as tan...
Automatic speech recognition (ASR) technology has achieved a level of maturity, where it is already ...
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult ...
A major research activity at LIMSI is multilingual, speakerindependent, large vocabulary speech dict...