The mismatch between enrollment and test utterances due to different types of variabilities is a great challenge in speaker verification. Based on the observation that the SNR-level variability or channel-type variability causes heterogeneous clusters in i-vector space, this paper proposes to apply supervised learning to drive or guide the learning of probabilistic linear discriminant analysis (PLDA) mixture models. Specifically, a deep neural network (DNN) is trained to produce the posterior probabilities of different SNR levels or channel types given i-vectors as input. These posteriors then replace the posterior probabilities of indicator variables in the mixture of PLDA. The discriminative training causes the mixture model to perform mo...
The i-vector and Joint Factor Analysis (JFA) systems for text- dependent speaker verification use su...
Proceedings of Interspeech 2013, Lyon (France)A significant amount of speech data is required to dev...
In the paper recent methods used in the task of speaker recognition are presented. At first, the ext...
15th Annual Conference of the International Speech Communication Association: Celebrating the Divers...
9th International Symposium on Chinese Spoken Language Processing, ISCSLP 2014, 12-14 September 2014...
Probabilistic Linear Discriminant Analysis (PLDA) is the most efficient backend for i-vectors. Howev...
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non...
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non...
With the ubiquitous of mobile phones, users of speaker verification systems will perform authenticat...
The availability of multiple utterances (and hence, i-vectors) for speaker en-rollment brings up sev...
In this paper, we address the problem of speaker verification in conditions unseen or unknown during...
Abstract—The popular i-vector approach to speaker recog-nition represents a speech segment as an i-v...
The lack of labeled background data makes a big performance gap between cosine and Probabilistic Lin...
The performance of state-of-the-art i-vector speaker verification systems relies on a large amount o...
This paper analyses the probabilistic linear discriminant analysis (PLDA) speaker verification appro...
The i-vector and Joint Factor Analysis (JFA) systems for text- dependent speaker verification use su...
Proceedings of Interspeech 2013, Lyon (France)A significant amount of speech data is required to dev...
In the paper recent methods used in the task of speaker recognition are presented. At first, the ext...
15th Annual Conference of the International Speech Communication Association: Celebrating the Divers...
9th International Symposium on Chinese Spoken Language Processing, ISCSLP 2014, 12-14 September 2014...
Probabilistic Linear Discriminant Analysis (PLDA) is the most efficient backend for i-vectors. Howev...
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non...
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non...
With the ubiquitous of mobile phones, users of speaker verification systems will perform authenticat...
The availability of multiple utterances (and hence, i-vectors) for speaker en-rollment brings up sev...
In this paper, we address the problem of speaker verification in conditions unseen or unknown during...
Abstract—The popular i-vector approach to speaker recog-nition represents a speech segment as an i-v...
The lack of labeled background data makes a big performance gap between cosine and Probabilistic Lin...
The performance of state-of-the-art i-vector speaker verification systems relies on a large amount o...
This paper analyses the probabilistic linear discriminant analysis (PLDA) speaker verification appro...
The i-vector and Joint Factor Analysis (JFA) systems for text- dependent speaker verification use su...
Proceedings of Interspeech 2013, Lyon (France)A significant amount of speech data is required to dev...
In the paper recent methods used in the task of speaker recognition are presented. At first, the ext...