The aim of this work is to gain insights into how the deep neural network (DNN) models should be trained for short utterance evaluation conditions in an x-vector based speaker verification system. The study suggests that the speaker embedding can be extracted with reduced dimensions for short utterance evaluation conditions. When the speaker embedding is extracted from deeper layer which has lower dimension, the x-vector system achieves 14% relative improvement over baseline approach on EER on NIST2010 5sec-5sec truncated conditions. We surmise that since short utterances have less phonetic information speaker discriminative x-vectors can be extracted from a deeper layer of the DNN which captures less phonetic information. Another interesti...
A significant amount of speech is typically required for speaker verification system development and...
In typical x-vector-based speaker recognition systems, standard linear discriminant analysis (LDA) i...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...
The aim of this work is to gain insights into how the deep neural network (DNN) models should be tra...
In the recent past, Deep neural networks became the most successful approach to extract the speaker ...
This paper proposes techniques to improve the performance of i-vector based speaker verification sys...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
Robust speaker verification on short utterances remains a key consideration when deploying automatic...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
Proceedings of Interspeech 2013, Lyon (France)A significant amount of speech is typically required f...
A significant amount of speech is typically required for speaker verification system development and...
This paper proposes a combination of source-normalized\ud weighted linear discriminant analysis (SN-...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
This paper proposes and evaluates classifiers based on Vocal Tract Length Normalization (VTLN) in a ...
Speaker characterization has always been conditioned by the length of the evaluated utterances. Desp...
A significant amount of speech is typically required for speaker verification system development and...
In typical x-vector-based speaker recognition systems, standard linear discriminant analysis (LDA) i...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...
The aim of this work is to gain insights into how the deep neural network (DNN) models should be tra...
In the recent past, Deep neural networks became the most successful approach to extract the speaker ...
This paper proposes techniques to improve the performance of i-vector based speaker verification sys...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
Robust speaker verification on short utterances remains a key consideration when deploying automatic...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
Proceedings of Interspeech 2013, Lyon (France)A significant amount of speech is typically required f...
A significant amount of speech is typically required for speaker verification system development and...
This paper proposes a combination of source-normalized\ud weighted linear discriminant analysis (SN-...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
This paper proposes and evaluates classifiers based on Vocal Tract Length Normalization (VTLN) in a ...
Speaker characterization has always been conditioned by the length of the evaluated utterances. Desp...
A significant amount of speech is typically required for speaker verification system development and...
In typical x-vector-based speaker recognition systems, standard linear discriminant analysis (LDA) i...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...