International audienceThis paper investigates speaker adaptation techniques for bidirectional long short term memory (BLSTM) recurrent neural network based acoustic models (AMs) trained with the connectionist temporal classification (CTC) objective function.BLSTM-CTC AMs play an important role in end-to-end automatic speech recognition systems.However, there is a lack of research in speaker adaptation algorithms for these models. We explore three different feature-space adaptation approaches for CTC AMs: feature-space maximum linear regression, i-vector based adaptation, and maximum a posteriori adaptation using GMM-derived features.Experimental results on the TED-LIUM corpus demonstrate that speaker adaptation, applied in combination wi...
Abstract—In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique fo...
Hybrid connectionistfHMM systems model time both using a Markov chain and through properties of a co...
Hybrid connectionist/HMM systems model time using both a Markov chain and through properties of a co...
Long Short-Term Memory (LSTM) is a recurrent neural net-work (RNN) architecture specializing in mode...
This work introduces a multiple connectionist architecture based on a mixture of Recurrent Neural Ne...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Probabilistic linear discriminant analysis (PLDA) acoustic models extend Gaussian mixture models by ...
Multilingual models for Automatic Speech Recognition (ASR) are attractive as they have been shown to...
Multilingual models for Automatic Speech Recognition (ASR) are attractive as they have been shown to...
This paper proposes a simple yet effective model-based neural network speaker adaptation technique t...
Though speaker adaptation has long been an importing topic in automatic speech recognition, the brea...
This paper addresses speaker adaptive acoustic modeling, based on feature space maximum likelih...
In this paper, a new method called Maximum Likelihood Neural Regression (MLNR) is introduced for Rap...
Automatic speech recognition (ASR) incorporates knowledge and research in linguistics, computer scie...
Abstract—In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique fo...
Hybrid connectionistfHMM systems model time both using a Markov chain and through properties of a co...
Hybrid connectionist/HMM systems model time using both a Markov chain and through properties of a co...
Long Short-Term Memory (LSTM) is a recurrent neural net-work (RNN) architecture specializing in mode...
This work introduces a multiple connectionist architecture based on a mixture of Recurrent Neural Ne...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Speaker adaptation of deep neural networks (DNNs) based acoustic models is still a challenging area ...
Probabilistic linear discriminant analysis (PLDA) acoustic models extend Gaussian mixture models by ...
Multilingual models for Automatic Speech Recognition (ASR) are attractive as they have been shown to...
Multilingual models for Automatic Speech Recognition (ASR) are attractive as they have been shown to...
This paper proposes a simple yet effective model-based neural network speaker adaptation technique t...
Though speaker adaptation has long been an importing topic in automatic speech recognition, the brea...
This paper addresses speaker adaptive acoustic modeling, based on feature space maximum likelih...
In this paper, a new method called Maximum Likelihood Neural Regression (MLNR) is introduced for Rap...
Automatic speech recognition (ASR) incorporates knowledge and research in linguistics, computer scie...
Abstract—In acoustic modeling, speaker adaptive training (SAT) has been a long-standing technique fo...
Hybrid connectionistfHMM systems model time both using a Markov chain and through properties of a co...
Hybrid connectionist/HMM systems model time using both a Markov chain and through properties of a co...