Reservoir Computing (RC) has recently been introduced as an interesting alternative for acoustic modeling. For phone and continuous digit recognition, the reservoir approach obtained quite promising results. In this work, we further elaborate this concept by porting some well-known techniques used to enhance recognition rates of GMM-based models to Reservoir Computing. In particular, we introduce context-dependent (CD) triphone states to model co-articulation and pronunciation mismatches arising from an imperfect lexicon. We also propose to incorporate two speaker normalization methods in the feature space, namely mean \& variance normalization and vocal tract length normalization. The impact of the investigated techniques is studied in the...
This paper describes continuous speech recognition incorporating the additional complement informati...
Speech recognition applications are known to require a significant amount of resources (training dat...
It has been shown in several recent publications that application of vocal tract normalization (VTN)...
Reservoir Computing (RC) has recently been introduced as an interesting alternative for acoustic mod...
Accurate acoustic modeling is an essential requirement of a state-of-the-art continuous speech recog...
Thanks to recent research in neural network based acoustic modeling, Large Vocabulary Continuous Spe...
In this paper we propose an efficient method to utilize context in the output densities of HMMs. Sta...
Automatic speech recognition has gradually improved over the years, but the reliable recognition of ...
In earlier work we have shown that good phoneme recognition is possible with a so-called reservoir, ...
This thesis addresses the problem of speech phone recognition. Phones are the acoustic sounds of spe...
For most speech recognition systems dynamic features are the only way to incorporate temporal contex...
A crucial issue in triphone based continuous speech recogni-tion is the large number of models to be...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
In this paper a formerly proposed continuous digit recognition system based on Reservoir Computing (...
Recent work in phonetic speaker recognition has shown that modeling phone sequences using n-grams is...
This paper describes continuous speech recognition incorporating the additional complement informati...
Speech recognition applications are known to require a significant amount of resources (training dat...
It has been shown in several recent publications that application of vocal tract normalization (VTN)...
Reservoir Computing (RC) has recently been introduced as an interesting alternative for acoustic mod...
Accurate acoustic modeling is an essential requirement of a state-of-the-art continuous speech recog...
Thanks to recent research in neural network based acoustic modeling, Large Vocabulary Continuous Spe...
In this paper we propose an efficient method to utilize context in the output densities of HMMs. Sta...
Automatic speech recognition has gradually improved over the years, but the reliable recognition of ...
In earlier work we have shown that good phoneme recognition is possible with a so-called reservoir, ...
This thesis addresses the problem of speech phone recognition. Phones are the acoustic sounds of spe...
For most speech recognition systems dynamic features are the only way to incorporate temporal contex...
A crucial issue in triphone based continuous speech recogni-tion is the large number of models to be...
The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling ...
In this paper a formerly proposed continuous digit recognition system based on Reservoir Computing (...
Recent work in phonetic speaker recognition has shown that modeling phone sequences using n-grams is...
This paper describes continuous speech recognition incorporating the additional complement informati...
Speech recognition applications are known to require a significant amount of resources (training dat...
It has been shown in several recent publications that application of vocal tract normalization (VTN)...