This paper describes the extension and optimisation of our previous work on very deep convolutional neural networks (CNNs) for effective recognition of noisy speech in the Aurora 4 task. The appropriate number of convolutional layers, the sizes of the filters, pooling operations and input feature maps are all modified: the filter and pooling sizes are reduced and dimensions of input feature maps are extended to allow adding more convolutional layers. Furthermore appropriate input padding and input feature map selection strategies are developed. In addition, an adaptation framework using joint training of very deep CNN with auxiliary features i-vector and fMLLR features is developed. These modifications give substantial word error rate reduc...
Deep learning is an emerging technology that is considered one of the most promising directions for ...
State-of-the-art automatic speech recognition systems model the relation-ship between acoustic speec...
Recently, automatic speech recognition (ASR) systems that use deep neural networks (DNNs) for acoust...
Recently, convolutional neural networks (CNNs) have been shown to outperform the standard fully conn...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
In many applications, speech recognition must operate in conditions where there are some distances b...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
Deep learning is an emerging technology that is one of the most promising areas of artificial intell...
We proposed an approach to build a robust automatic speech recognizer using deep convolutional neura...
Deep neural networks (DNNs) have been playing a significant role in acoustic modeling. Convolutional...
Representation learning is a fundamental ingredient of deep learning. However, learning a good repre...
This paper proposes two novel frontends for robust lan-guage identification (LID) using a convolutio...
State-of-the-art automatic speech recognition systems model the re-lationship between acoustic speec...
Deep learning-based machine learning models have shown significant results in speech recognition and...
In the last few decades, there has been considerable amount of research on the use of Machine Learni...
Deep learning is an emerging technology that is considered one of the most promising directions for ...
State-of-the-art automatic speech recognition systems model the relation-ship between acoustic speec...
Recently, automatic speech recognition (ASR) systems that use deep neural networks (DNNs) for acoust...
Recently, convolutional neural networks (CNNs) have been shown to outperform the standard fully conn...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
In many applications, speech recognition must operate in conditions where there are some distances b...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
Deep learning is an emerging technology that is one of the most promising areas of artificial intell...
We proposed an approach to build a robust automatic speech recognizer using deep convolutional neura...
Deep neural networks (DNNs) have been playing a significant role in acoustic modeling. Convolutional...
Representation learning is a fundamental ingredient of deep learning. However, learning a good repre...
This paper proposes two novel frontends for robust lan-guage identification (LID) using a convolutio...
State-of-the-art automatic speech recognition systems model the re-lationship between acoustic speec...
Deep learning-based machine learning models have shown significant results in speech recognition and...
In the last few decades, there has been considerable amount of research on the use of Machine Learni...
Deep learning is an emerging technology that is considered one of the most promising directions for ...
State-of-the-art automatic speech recognition systems model the relation-ship between acoustic speec...
Recently, automatic speech recognition (ASR) systems that use deep neural networks (DNNs) for acoust...