This paper describes the IDLab submission for the text-independent task of the Short-duration Speaker Verification Challenge 2021 (SdSVC-21). This speaker verification competition focuses on short duration test recordings and cross-lingual trials, along with the constraint of limited availability of in-domain DeepMine Farsi training data. Currently, both Time Delay Neural Networks (TDNNs) and ResNets achieve state-of-the-art results in speaker verification. These architectures are structurally very different and the construction of hybrid networks looks a promising way forward. We introduce a 2D convolutional stem in a strong ECAPA-TDNN baseline to transfer some of the strong characteristics of a ResNet based model to this hybrid CNN-TDNN a...
The objective of this paper is speaker recognition `in the wild' - where utterances may be of variab...
In this technical report, we describe the Royalflush submissions for the VoxCeleb Speaker Recognitio...
This paper explores novel ideas in building end-to-end deep neural network (DNN) based text-dependen...
Current speaker verification techniques rely on a neural network to extract speaker representations....
Under the short utterance environment, the total variability space underestimates the distribution o...
Convolutional neural networks (CNNs) have significantly promoted the development of speaker verifica...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
Time delay neural networks (TDNNs) are an effective acoustic model for large vocabulary speech recog...
In this paper, we provide description of our submitted systems to the Short Duration Speaker Verific...
In speaker recognition tasks, convolutional neural network (CNN)-based approaches have shown signifi...
To extract accurate speaker information for text-independent speaker verification, temporal dynamic ...
\Lambda, sundarg.iitm.ernet.in Abstract In this paper, we propose two neural network-based approache...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
This paper presents an improved deep embedding learning method based on convolutional neural network...
The objective of this paper is speaker recognition `in the wild' - where utterances may be of variab...
In this technical report, we describe the Royalflush submissions for the VoxCeleb Speaker Recognitio...
This paper explores novel ideas in building end-to-end deep neural network (DNN) based text-dependen...
Current speaker verification techniques rely on a neural network to extract speaker representations....
Under the short utterance environment, the total variability space underestimates the distribution o...
Convolutional neural networks (CNNs) have significantly promoted the development of speaker verifica...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
Time delay neural networks (TDNNs) are an effective acoustic model for large vocabulary speech recog...
In this paper, we provide description of our submitted systems to the Short Duration Speaker Verific...
In speaker recognition tasks, convolutional neural network (CNN)-based approaches have shown signifi...
To extract accurate speaker information for text-independent speaker verification, temporal dynamic ...
\Lambda, sundarg.iitm.ernet.in Abstract In this paper, we propose two neural network-based approache...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
This paper presents an improved deep embedding learning method based on convolutional neural network...
The objective of this paper is speaker recognition `in the wild' - where utterances may be of variab...
In this technical report, we describe the Royalflush submissions for the VoxCeleb Speaker Recognitio...
This paper explores novel ideas in building end-to-end deep neural network (DNN) based text-dependen...