A method for speaker normalization in deep neural network (DNN) based discriminative feature estimation for automatic speech recog-nition (ASR) is presented. This method is applied in the context of a DNN configured for auto-encoder based low dimensional bot-tleneck (AE-BN) feature extraction where the derived features are used as input to a continuous Gaussian density hidden Markov model (HMM/GMM) based ASR decoder. While AE-BN features are known to provide significant reduction in ASR word error rate (WER) with respect to more conventional spectral magnitude based features, there is no general agreement on how these networks can reduce the impact of speaker variability by incorporating prior knowledge of the speaker. An approach is presen...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
In the paper we present two techniques improving the recognition accuracy of multilayer perceptron n...
This paper investigates the use of parameterised sigmoid and rectified linear unit (ReLU) hidden act...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
Deep learning and neural network research has grown significantly in the fields of automatic speech ...
Recently, automatic speech recognition (ASR) systems that use deep neural networks (DNNs) for acoust...
We investigate the concept of speaker adaptive training (SAT) in the context of deep neural network ...
In this work, the speaker normalisation problem is afforded by two different techniques. The first o...
Deep Neural Network (DNN) has become a standard method in many ASR tasks. Recently there is consider...
Automatic speech recognition (ASR) incorporates knowledge and research in linguistics, computer scie...
In spite of recent advances in automatic speech recognition, the performance of state-of-the-art spe...
<p>We investigate the concept of speaker adaptive training (SAT) in the context of deep neural netwo...
This paper introduces deep neural network (DNN)–hidden Markov model (HMM)-based methods to tackle sp...
This paper introduces approaches based on vocal tract length normalisation (VTLN) techniques for hyb...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
In the paper we present two techniques improving the recognition accuracy of multilayer perceptron n...
This paper investigates the use of parameterised sigmoid and rectified linear unit (ReLU) hidden act...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
Deep learning and neural network research has grown significantly in the fields of automatic speech ...
Recently, automatic speech recognition (ASR) systems that use deep neural networks (DNNs) for acoust...
We investigate the concept of speaker adaptive training (SAT) in the context of deep neural network ...
In this work, the speaker normalisation problem is afforded by two different techniques. The first o...
Deep Neural Network (DNN) has become a standard method in many ASR tasks. Recently there is consider...
Automatic speech recognition (ASR) incorporates knowledge and research in linguistics, computer scie...
In spite of recent advances in automatic speech recognition, the performance of state-of-the-art spe...
<p>We investigate the concept of speaker adaptive training (SAT) in the context of deep neural netwo...
This paper introduces deep neural network (DNN)–hidden Markov model (HMM)-based methods to tackle sp...
This paper introduces approaches based on vocal tract length normalisation (VTLN) techniques for hyb...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
In the paper we present two techniques improving the recognition accuracy of multilayer perceptron n...
This paper investigates the use of parameterised sigmoid and rectified linear unit (ReLU) hidden act...