Recently, the deep neural networks (DNNs) based acoustic modeling methods have been successfully applied to many speech recognition tasks. This paper reports the work about applying DNNs for syllable based acoustic modeling in Chinese automatic speech recognition (ASR). Compared with initial/finals (IFs), syllable can implicitly model the intra-syllable variations in better accuracy. However, the context dependent syllable based modeling set holds too many units, bringing about heavy problems on modeling and decoding implementation. In this paper, a WFST decoding framework is applied. Moreover, the decision tree based state tying and DNNs based models are discussed for the acoustic model training. The experimental results show that compared...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
To make full use of a small development data set to build a robust dialectal Chinese speech recogniz...
Abstract—In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique fo...
Abstract—Recently, the deep neural networks (DNNs) based acoustic modeling methods have been success...
This paper compared the performance of different acoustic modeling units in deep neural networks (DN...
This paper compared the performance of different acoustic modeling units in deep neural networks (DN...
The choice of basic modeling unit in building acoustic model for a continuous Mandarin speech recogn...
The choice of basic modeling unit in building acoustic model for a continuous Mandarin speech recogn...
A novel acoustic modeling method for Chinese speech recognition based on Intra-Syllable Dependent Ph...
This paper describes the new framework of context-dependent (CD) Initial/Final (IF) acoustic modelin...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
This paper investigates the use of Multi-Distribution Deep Neu-ral Networks (MD-DNNs) for integratin...
This paper presents a distinctive phonetic features (DPFs) based phoneme recognition method by incor...
Abstract—Recently, context-dependent deep neural network hidden Markov models (CD-DNN-HMMs) have bee...
Recently, deep neural networks (DNNs) have outperformed traditional acoustic models on a variety of ...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
To make full use of a small development data set to build a robust dialectal Chinese speech recogniz...
Abstract—In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique fo...
Abstract—Recently, the deep neural networks (DNNs) based acoustic modeling methods have been success...
This paper compared the performance of different acoustic modeling units in deep neural networks (DN...
This paper compared the performance of different acoustic modeling units in deep neural networks (DN...
The choice of basic modeling unit in building acoustic model for a continuous Mandarin speech recogn...
The choice of basic modeling unit in building acoustic model for a continuous Mandarin speech recogn...
A novel acoustic modeling method for Chinese speech recognition based on Intra-Syllable Dependent Ph...
This paper describes the new framework of context-dependent (CD) Initial/Final (IF) acoustic modelin...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
This paper investigates the use of Multi-Distribution Deep Neu-ral Networks (MD-DNNs) for integratin...
This paper presents a distinctive phonetic features (DPFs) based phoneme recognition method by incor...
Abstract—Recently, context-dependent deep neural network hidden Markov models (CD-DNN-HMMs) have bee...
Recently, deep neural networks (DNNs) have outperformed traditional acoustic models on a variety of ...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
To make full use of a small development data set to build a robust dialectal Chinese speech recogniz...
Abstract—In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique fo...