The objective of this work is to study state-of-the-art deep neural networks based speaker verification systems called x-vectors on various conditions, such as wideband and narrowband data and to develop the system, which is robust to unseen language, specific noise or speech codec. This system takes variable length audio recording and maps it into fixed length embedding which is afterward used to represent the speaker. We compared our systems to BUT's submission to Speakers in the Wild Speaker Recognition Challenge (SITW) from 2016, which used previously popular statistical models - i-vectors. We observed, that when comparing single best systems, with recently published x-vectors we were able to obtain more than 4.38 times lower Equal Erro...
In the recent past, Deep neural networks became the most successful approach to extract the speaker ...
We examine the use of Deep Neural Networks (DNN) in extracting Baum-Welch statistics for i-vector-ba...
The lack of labeled background data makes a big performance gap between cosine and Probabilistic Lin...
Tématem této práce je analýza nejmodernějších systémů pro rozpoznávání řečníka za použití neurónovýc...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
This paper presents the QUT speaker recognition system, as a competing system in the Speakers In The...
This paper presents the QUT speaker recognition system, as a competing system in the Speakers In The...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...
The aim of this work is to gain insights into how the deep neural network (DNN) models should be tra...
The performance of speaker recognition systems has considerably improved in the last decade. This is...
In this paper, we address the problem of speaker verification in conditions unseen or unknown during...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
Learning robust speaker embeddings is a crucial step in speaker diarization. Deep neural networks ca...
<p>This paper describes the Intelligent Voice (IV) speaker diarization system for IberSPEECH-RTVE 20...
International audienceSpeaker verification (SV) suffers from unsatisfactory performance in far-field...
In the recent past, Deep neural networks became the most successful approach to extract the speaker ...
We examine the use of Deep Neural Networks (DNN) in extracting Baum-Welch statistics for i-vector-ba...
The lack of labeled background data makes a big performance gap between cosine and Probabilistic Lin...
Tématem této práce je analýza nejmodernějších systémů pro rozpoznávání řečníka za použití neurónovýc...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
This paper presents the QUT speaker recognition system, as a competing system in the Speakers In The...
This paper presents the QUT speaker recognition system, as a competing system in the Speakers In The...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...
The aim of this work is to gain insights into how the deep neural network (DNN) models should be tra...
The performance of speaker recognition systems has considerably improved in the last decade. This is...
In this paper, we address the problem of speaker verification in conditions unseen or unknown during...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
Learning robust speaker embeddings is a crucial step in speaker diarization. Deep neural networks ca...
<p>This paper describes the Intelligent Voice (IV) speaker diarization system for IberSPEECH-RTVE 20...
International audienceSpeaker verification (SV) suffers from unsatisfactory performance in far-field...
In the recent past, Deep neural networks became the most successful approach to extract the speaker ...
We examine the use of Deep Neural Networks (DNN) in extracting Baum-Welch statistics for i-vector-ba...
The lack of labeled background data makes a big performance gap between cosine and Probabilistic Lin...