Deep learning and neural network research has grown significantly in the fields of automatic speech recognition (ASR) and speaker recognition. Compared to traditional methods, deep learning-based approaches are more powerful in learning representation from data and building complex models. In this dissertation, we focus on representation learning and modeling using neural network-based approaches for speech and speaker recognition. In the first part of the dissertation, we present two novel neural network-based methods to learn speaker-specific and phoneme-invariant features for short-utterance speaker verification. We first propose to learn a spectral feature mapping from each speech signal to the corresponding subglottal acoustic signal w...
International audienceMost state-of-the-art speech systems use deep neural networks (DNNs). These sy...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
Recently, automatic speech recognition (ASR) systems that use deep neural networks (DNNs) for acoust...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
Representation learning is a fundamental ingredient of deep learning. However, learning a good repre...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
This thesis makes three main contributions to the area of speech recognition with Deep Neural Networ...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
The field of digital speech processing may be divided into three distinct and somewhat in- depend...
A method for speaker normalization in deep neural network (DNN) based discriminative feature estimat...
The recent speaker embeddings framework has been shown to provide excellent performance on the task ...
International audienceMost state-of-the-art speech systems use deep neural networks (DNNs). These sy...
International audienceMost state-of-the-art speech systems use deep neural networks (DNNs). These sy...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...
Automatic speech recognition (ASR) is a key core technology for the information age. ASR systems hav...
Recently, automatic speech recognition (ASR) systems that use deep neural networks (DNNs) for acoust...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
Representation learning is a fundamental ingredient of deep learning. However, learning a good repre...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
This thesis makes three main contributions to the area of speech recognition with Deep Neural Networ...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
The field of digital speech processing may be divided into three distinct and somewhat in- depend...
A method for speaker normalization in deep neural network (DNN) based discriminative feature estimat...
The recent speaker embeddings framework has been shown to provide excellent performance on the task ...
International audienceMost state-of-the-art speech systems use deep neural networks (DNNs). These sy...
International audienceMost state-of-the-art speech systems use deep neural networks (DNNs). These sy...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
The introduction of deep neural networks (DNNs) has advanced the performance of automatic speech rec...