We investigate two strategies to improve the context-dependent deep neural network hidden Markov model (CD-DNN-HMM) in low-resource speech recognition. Although outperforming the conventional Gaussian mixture model (GMM) HMM on various tasks, CD-DNN-HMM acoustic modeling becomes challenging with limited transcribed speech, e.g., less than 10 hours. To resolve this issue, we firstly exploit dropout which prevents overfitting in DNN finetuning and improves model robustness under data sparseness. Then, the effectiveness of multilingual DNN training is evaluated when additional auxiliary languages are available. The hidden layer parameters of the target language are shared and learned over multiple languages. Experiments show that both strategi...
In this work, we propose several deep neural network architectures that are able to leverage data fr...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Training speech recognizer with under-resourced language data still proves difficult. Indonesian lan...
We investigate two strategies to improve the context-dependent deep neural network hidden Markov mod...
We propose a new technique for training deep neural networks (DNNs) as data-driven feature front-end...
Recently, context-dependent (CD) deep neural network (DNN) hidden Markov models (HMMs) have been wid...
Abstract—Recently, context-dependent deep neural network hidden Markov models (CD-DNN-HMMs) have bee...
We investigate multilingual modeling in the context of a deep neural network (DNN) – hidden Markov ...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
We describe a novel way to implement subword language models in speech recognition systems based on ...
AbstractMultilingual Deep Neural Networks (DNNs) have been successfully used to leverage out-of-lang...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
For resource rich languages, recent works have shown Neural Network based Language Models (NNLMs) to...
© 2016 The Authors. Multilingual Deep Neural Networks (DNNs) have been successfully used to leverage...
In this work, we propose several deep neural network architectures that are able to leverage data fr...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Training speech recognizer with under-resourced language data still proves difficult. Indonesian lan...
We investigate two strategies to improve the context-dependent deep neural network hidden Markov mod...
We propose a new technique for training deep neural networks (DNNs) as data-driven feature front-end...
Recently, context-dependent (CD) deep neural network (DNN) hidden Markov models (HMMs) have been wid...
Abstract—Recently, context-dependent deep neural network hidden Markov models (CD-DNN-HMMs) have bee...
We investigate multilingual modeling in the context of a deep neural network (DNN) – hidden Markov ...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
We describe a novel way to implement subword language models in speech recognition systems based on ...
AbstractMultilingual Deep Neural Networks (DNNs) have been successfully used to leverage out-of-lang...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
For resource rich languages, recent works have shown Neural Network based Language Models (NNLMs) to...
© 2016 The Authors. Multilingual Deep Neural Networks (DNNs) have been successfully used to leverage...
In this work, we propose several deep neural network architectures that are able to leverage data fr...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Training speech recognizer with under-resourced language data still proves difficult. Indonesian lan...