Hybrid deep neural network-hidden Markov model (DNN-HMM) systems have become the state-of-the-art in automatic speech recognition. In this paper we experiment with DNN-HMM phone recognition systems that use measured articulatory information. Deep neural networks are both used to compute phone posterior probabilities and to perform acoustic-to-articulatory mapping (AAM). The AAM processes we propose are based on deep representations of the acoustic and the articulatory domains. Such representations allow to: (i) create different pre-training configurations of the DNNs that perform AAM; (ii) perform AAM on a transformed (through DNN autoencoders) articulatory feature (AF) space that captures strong statistical dependencies between articulator...
Abstract Most of the current state-of-the-art speech recognition systems are based on speech signal ...
This paper presents a method for describing the effect of articulatory trajectories on phoneme recog...
Automatic Speech Recognition (ASR) is a challenging classification task over sequences of acoustic f...
In this paper we experiment with methods based on Deep Belief Networks (DBNs) to recover measured ar...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
This thesis makes three main contributions to the area of speech recognition with Deep Neural Networ...
Acoustic modeling in state-of-the-art speech recognition systems usually relies on hidden Markov mod...
Hidden Markov models (HMMs) have been the mainstream acoustic modelling approach for state-of-the-ar...
This paper presents a deep neural network (DNN) to extract articulatory information from the speech ...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
We investigate the problem of speaker independent acoustic-to-articulatory inversion (AAI) in noisy ...
Neural networks, especially those with more than one hidden layer, have re-emerged in Automatic Spee...
In hidden Markov model (HMM) based automatic speech recognition (ASR) system, modeling the statistic...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
Three neural network models were trained on the forward mapping from articulatory positions to acous...
Abstract Most of the current state-of-the-art speech recognition systems are based on speech signal ...
This paper presents a method for describing the effect of articulatory trajectories on phoneme recog...
Automatic Speech Recognition (ASR) is a challenging classification task over sequences of acoustic f...
In this paper we experiment with methods based on Deep Belief Networks (DBNs) to recover measured ar...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
This thesis makes three main contributions to the area of speech recognition with Deep Neural Networ...
Acoustic modeling in state-of-the-art speech recognition systems usually relies on hidden Markov mod...
Hidden Markov models (HMMs) have been the mainstream acoustic modelling approach for state-of-the-ar...
This paper presents a deep neural network (DNN) to extract articulatory information from the speech ...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
We investigate the problem of speaker independent acoustic-to-articulatory inversion (AAI) in noisy ...
Neural networks, especially those with more than one hidden layer, have re-emerged in Automatic Spee...
In hidden Markov model (HMM) based automatic speech recognition (ASR) system, modeling the statistic...
Recently, the hybrid deep neural network (DNN)-hidden Markov model (HMM) has been shown to significa...
Three neural network models were trained on the forward mapping from articulatory positions to acous...
Abstract Most of the current state-of-the-art speech recognition systems are based on speech signal ...
This paper presents a method for describing the effect of articulatory trajectories on phoneme recog...
Automatic Speech Recognition (ASR) is a challenging classification task over sequences of acoustic f...