A series of theoretical and experimental results have suggested that multilayer perceptrons (MLPs) are an effective family of algorithms for the smooth estimate of highly dimensioned probability density functions that are useful in continuous speech recognition. All of these systems have exclusively used context-independent phonetic models, in the sense that the probabilities or costs are estimated for simple speech units such as phonemes or words, rather than biphones or triphones. Numerous conventional systems based on hidden Markov models (HMMs) have been reported that use triphone or triphone like context-dependent models. In one case the outputs of many context-dependent MLPs (one per context class) were used to help choose the best se...
The use of context-dependent targets has become standard in hybrid DNN systems for automatic speech ...
Abstract—Recently, context-dependent deep neural network hidden Markov models (CD-DNN-HMMs) have bee...
Recently neural networks have been used successfully for real-time large vocabulary speech recogniti...
We present he concept of a "Segmental Neural Net " (SNN) for phonetic modeling in continuo...
Previously, we had developed the concept of a Segmental Neural Net (SNN) for phonetic modeling in co...
Recently, context-dependent (CD) deep neural network (DNN) hidden Markov models (HMMs) have been wid...
In this work we do a comparative evaluation between Artificial Neural Networks (RNA's) and Continuou...
In this paper, we propose a neural network based model of robust speech recognition by integrating a...
This paper presents a new hybrid system for speaker independent continuous speech recognition in a l...
A set of recurrent artificial neural networks are used for speech recognition. By representing speec...
This paper describes continuous speech recognition incorporating the additional complement informati...
A summary of the theory of the hybrid connectionist HMM (hidden Markov model) continuous speech reco...
In recent years, researchers have established the viability of so called hybrid NN/HMM large vocabul...
In this paper, hybrid HMM/ANN systems are used to model context dependent phones. In order to reduce...
Conventional statistical parametric speech synthesis relies on decision trees to cluster together si...
The use of context-dependent targets has become standard in hybrid DNN systems for automatic speech ...
Abstract—Recently, context-dependent deep neural network hidden Markov models (CD-DNN-HMMs) have bee...
Recently neural networks have been used successfully for real-time large vocabulary speech recogniti...
We present he concept of a "Segmental Neural Net " (SNN) for phonetic modeling in continuo...
Previously, we had developed the concept of a Segmental Neural Net (SNN) for phonetic modeling in co...
Recently, context-dependent (CD) deep neural network (DNN) hidden Markov models (HMMs) have been wid...
In this work we do a comparative evaluation between Artificial Neural Networks (RNA's) and Continuou...
In this paper, we propose a neural network based model of robust speech recognition by integrating a...
This paper presents a new hybrid system for speaker independent continuous speech recognition in a l...
A set of recurrent artificial neural networks are used for speech recognition. By representing speec...
This paper describes continuous speech recognition incorporating the additional complement informati...
A summary of the theory of the hybrid connectionist HMM (hidden Markov model) continuous speech reco...
In recent years, researchers have established the viability of so called hybrid NN/HMM large vocabul...
In this paper, hybrid HMM/ANN systems are used to model context dependent phones. In order to reduce...
Conventional statistical parametric speech synthesis relies on decision trees to cluster together si...
The use of context-dependent targets has become standard in hybrid DNN systems for automatic speech ...
Abstract—Recently, context-dependent deep neural network hidden Markov models (CD-DNN-HMMs) have bee...
Recently neural networks have been used successfully for real-time large vocabulary speech recogniti...