Speaker verification (SV) is a task to verify a claimed identity from the voice signal. A well-performing SV system requires a method to transform a variable-length recording into a fixed-length representation (a.k.a. embedding vector), which compactly captures distinctive features over different speakers. There are two popular methods: i-vector and x-vector. Although i-vector is still used, x-vector outperforms i-vector in most SV tasks. However, x-vector has limitations: 1) the embedding still includes information about phones, and 2) it cannot leverage data without speaker labels. Phone information remaining in x-vector can degrade its performance in text-independent SV because utterances from the same speaker may have different embedd...
While promising performance for speaker verification has been achieved by deep speaker embeddings, t...
In this paper, a hierarchical attention network is proposed to generate robust utterance-level embed...
Speaker embeddings represent a means to extract representative vectorial representations from a spee...
In the recent past, Deep neural networks became the most successful approach to extract the speaker ...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
In this work we improve the performance of a speaker verification system by matching the feature vec...
State-of-the-art speaker verification systems are inherently dependent on some kind of human supervi...
The aim of this work is to gain insights into how the deep neural network (DNN) models should be tra...
This paper presents an overview of a state-of-the-art text-independent speaker verification system. ...
The aim of this work is to gain insights into how the deep neural network (DNN) models should be tra...
While promising performance for speaker verification has been achieved by deep speaker embeddings, t...
In this paper, a hierarchical attention network is proposed to generate robust utterance-level embed...
Speaker embeddings represent a means to extract representative vectorial representations from a spee...
In the recent past, Deep neural networks became the most successful approach to extract the speaker ...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
International audienceThis paper presents an overview of a state-of-the-art text-independent speaker...
In this work we improve the performance of a speaker verification system by matching the feature vec...
State-of-the-art speaker verification systems are inherently dependent on some kind of human supervi...
The aim of this work is to gain insights into how the deep neural network (DNN) models should be tra...
This paper presents an overview of a state-of-the-art text-independent speaker verification system. ...
The aim of this work is to gain insights into how the deep neural network (DNN) models should be tra...
While promising performance for speaker verification has been achieved by deep speaker embeddings, t...
In this paper, a hierarchical attention network is proposed to generate robust utterance-level embed...
Speaker embeddings represent a means to extract representative vectorial representations from a spee...