This paper proposes an algorithm to design a tied-state inventory for a context dependent, neural network-based acoustic model for speech recognition. Rather than relying on a GMM/HMM system that operates on a different feature space and is of a different model family, the proposed algorithm optimizes state tying on the activa-tion vectors of the neural network directly. Experiments show the viability of the proposed algorithm reducing the WER from 36.3% for a context independent system to 16.0 % for a 15000 tied-state system
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
In this paper, we propose a rank‐weighted reconstruction feature to improve the robustness of a feed...
Recently, context-dependent (CD) deep neural network (DNN) hidden Markov models (HMMs) have been wid...
We propose an algorithm that allows online training of a con-text dependent DNN model. It designs a ...
In previous work we have introduced a multi-task training tech-nique for neural network acoustic mod...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The use of context-dependent targets has become standard in hybrid DNN systems for automatic speech ...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
Accurate acoustic modeling is an essential requirement of a state-of-the-art continuous speech recog...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
We investigate the use of large state inventories and the soft-plus nonlinearity for on-device neura...
Speech recognition has been an important sector of research to enhance the user interaction with mac...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
In this work, we propose a modular combination of two pop-ular applications of neural networks to la...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
In this paper, we propose a rank‐weighted reconstruction feature to improve the robustness of a feed...
Recently, context-dependent (CD) deep neural network (DNN) hidden Markov models (HMMs) have been wid...
We propose an algorithm that allows online training of a con-text dependent DNN model. It designs a ...
In previous work we have introduced a multi-task training tech-nique for neural network acoustic mod...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
The use of context-dependent targets has become standard in hybrid DNN systems for automatic speech ...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
Accurate acoustic modeling is an essential requirement of a state-of-the-art continuous speech recog...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
We investigate the use of large state inventories and the soft-plus nonlinearity for on-device neura...
Speech recognition has been an important sector of research to enhance the user interaction with mac...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
In this work, we propose a modular combination of two pop-ular applications of neural networks to la...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
In this paper, we propose a rank‐weighted reconstruction feature to improve the robustness of a feed...