In this work, we propose a modular combination of two pop-ular applications of neural networks to large-vocabulary con-tinuous speech recognition. First, a deep neural network is trained to extract bottleneck features from frames of mel scale filterbank coefficients. In a similar way as is usually done for GMM/HMM systems, this network is then applied as a non-linear discriminative feature-space transformation for a hybrid setup where acoustic modeling is performed by a deep belief network. This effectively results in a very large network, where the layers of the bottleneck network are fixed and applied to suc-cessive windows of feature frames in a time-delay fashion. We show that bottleneck features improve the recognition perfor-mance of ...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
To improve speech recognition performance, a combination between TANDEM and bottleneck Deep Neural N...
Deep Bidirectional LSTM (DBLSTM) recurrent neural net-works have recently been shown to give state-o...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
We have recently proposed a new acoustic model based on prob-abilistic linear discriminant analysis ...
This paper describes a Deep Belief Neural Network (DBNN) and Bidirectional Long-Short Term Memory (L...
We have recently proposed a new acoustic model based on prob-abilistic linear discriminant analysis ...
In this work, we propose several deep neural network architectures that are able to leverage data fr...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In the recent years, Deep Neural Network-Hidden Markov Model (DNN-HMM) systems have overtaken the tr...
<p>In this work, we propose several deep neural network architectures that are able to leverage data...
Deep neural networks (DNNs) are now a central component of nearly all state-of-the-art speech recogn...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
Hybrid systems which integrate the deep neural network (DNN) and hidden Markov model (HMM) have rece...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
To improve speech recognition performance, a combination between TANDEM and bottleneck Deep Neural N...
Deep Bidirectional LSTM (DBLSTM) recurrent neural net-works have recently been shown to give state-o...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
We have recently proposed a new acoustic model based on prob-abilistic linear discriminant analysis ...
This paper describes a Deep Belief Neural Network (DBNN) and Bidirectional Long-Short Term Memory (L...
We have recently proposed a new acoustic model based on prob-abilistic linear discriminant analysis ...
In this work, we propose several deep neural network architectures that are able to leverage data fr...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
In the recent years, Deep Neural Network-Hidden Markov Model (DNN-HMM) systems have overtaken the tr...
<p>In this work, we propose several deep neural network architectures that are able to leverage data...
Deep neural networks (DNNs) are now a central component of nearly all state-of-the-art speech recogn...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
Hybrid systems which integrate the deep neural network (DNN) and hidden Markov model (HMM) have rece...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
To improve speech recognition performance, a combination between TANDEM and bottleneck Deep Neural N...
Deep Bidirectional LSTM (DBLSTM) recurrent neural net-works have recently been shown to give state-o...