In this work we assess the recently proposed hybrid Deep Neural Network/Gaussian Mixture Model (DNN/GMM) approach for speaker recognition considering the effects of the granularity of the phonetic DNN model, and of the precision of the corresponding GMM models, which will be referred to as the phonetic GMMs. The aim of this work is to better understand the contributions of the phonetic information provided by the DNN model with respect to the accuracy of the acous tic GMMs in fitting the distribution of the features associated to a given context-dependent phone state. The testbed for this work was the text-independent speaker recognition task defined by NIST for the 2012 Speaker Recognition Evaluation. Our experiment confirms that the acous...
This paper proposes some methods of robust text-independent speaker identification based on Gaussian...
This paper describes a GMM-based speaker verification system that uses speaker-dependent background ...
In Deep Neural Network (DNN) i-vector based speaker recognition systems, acoustic models trained for...
In this work we assess the recently proposed hybrid Deep Neural Network/Gaussian Mixture Model (DNN/...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
Speech recognition has been an important sector of research to enhance the user interaction with mac...
Despite intuitive expectation and experimental evidence that phonemes contain useful speaker discrim...
Deep neural networks (DNNs) have been shown to outperform Gaussian Mixture Models (GMM) on a variety...
Deep neural networks (DNNs) have been shown to outperform Gaussian Mixture Models (GMM) on a variety...
This chapter describes anchor model-based speaker recognition with phonetic modeling. Gaussian Mixtu...
Speaker recognition is a biometric operation of accepting a claimed person based on analyzing his sp...
The recent speaker embeddings framework has been shown to provide excellent performance on the task ...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
The problem of establishing the identity of a speaker from a given utterance has been conventionally...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...
This paper proposes some methods of robust text-independent speaker identification based on Gaussian...
This paper describes a GMM-based speaker verification system that uses speaker-dependent background ...
In Deep Neural Network (DNN) i-vector based speaker recognition systems, acoustic models trained for...
In this work we assess the recently proposed hybrid Deep Neural Network/Gaussian Mixture Model (DNN/...
Gaussian Mixture Model-Hidden Markov Models (GMM-HMMs) are the state-of-the-art for acoustic modelin...
Speech recognition has been an important sector of research to enhance the user interaction with mac...
Despite intuitive expectation and experimental evidence that phonemes contain useful speaker discrim...
Deep neural networks (DNNs) have been shown to outperform Gaussian Mixture Models (GMM) on a variety...
Deep neural networks (DNNs) have been shown to outperform Gaussian Mixture Models (GMM) on a variety...
This chapter describes anchor model-based speaker recognition with phonetic modeling. Gaussian Mixtu...
Speaker recognition is a biometric operation of accepting a claimed person based on analyzing his sp...
The recent speaker embeddings framework has been shown to provide excellent performance on the task ...
Recently, deep learning techniques have been successfully applied to automatic speech recognition (A...
The problem of establishing the identity of a speaker from a given utterance has been conventionally...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...
This paper proposes some methods of robust text-independent speaker identification based on Gaussian...
This paper describes a GMM-based speaker verification system that uses speaker-dependent background ...
In Deep Neural Network (DNN) i-vector based speaker recognition systems, acoustic models trained for...