International audienceModern automatic speaker verification relies largely on deep neural networks (DNNs) trained on mel-frequency cepstral coefficient (MFCC) features. While there are alternative feature extraction methods based on phase, prosody and long-term temporal operations, they have not been extensively studied with DNN-based methods. We aim to fill this gap by providing extensive re-assessment of 14 feature extractors on VoxCeleb and SITW datasets. Our findings reveal that features equipped with techniques such as spectral centroids, group delay function, and integrated noise suppression provide promising alternatives to MFCCs for deep speaker embeddings extraction. Experimental results demonstrate up to 16.3% (VoxCeleb) and 25.1%...
Automatic speaker recognition algorithms typically use pre-defined filterbanks, such as Mel-Frequenc...
Deep learning and neural network research has grown significantly in the fields of automatic speech ...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
Modern automatic speaker verification relies largely on deep neural networks (DNNs) trained on mel-f...
International audienceWe address far-field speaker verification with deep neural network (DNN) based...
This paper presents an improved deep embedding learning method based on convolutional neural network...
Learning representation from audio data has shown advantages over the handcrafted features such as m...
In this technical report, we describe the Royalflush submissions for the VoxCeleb Speaker Recognitio...
Speaker embeddings represent a means to extract representative vectorial representations from a spee...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
The performance of the automatic speaker recognition system is becoming more and more accurate, with...
International audienceMulti-taper estimators provide low-variance power spectrum estimates that can ...
Effective speaker identification is essential for achieving robust speaker recognition in real-world...
International audienceThis work tries to investigate the use of a Convolutional Neu-ral Network appr...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
Automatic speaker recognition algorithms typically use pre-defined filterbanks, such as Mel-Frequenc...
Deep learning and neural network research has grown significantly in the fields of automatic speech ...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
Modern automatic speaker verification relies largely on deep neural networks (DNNs) trained on mel-f...
International audienceWe address far-field speaker verification with deep neural network (DNN) based...
This paper presents an improved deep embedding learning method based on convolutional neural network...
Learning representation from audio data has shown advantages over the handcrafted features such as m...
In this technical report, we describe the Royalflush submissions for the VoxCeleb Speaker Recognitio...
Speaker embeddings represent a means to extract representative vectorial representations from a spee...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
The performance of the automatic speaker recognition system is becoming more and more accurate, with...
International audienceMulti-taper estimators provide low-variance power spectrum estimates that can ...
Effective speaker identification is essential for achieving robust speaker recognition in real-world...
International audienceThis work tries to investigate the use of a Convolutional Neu-ral Network appr...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
Automatic speaker recognition algorithms typically use pre-defined filterbanks, such as Mel-Frequenc...
Deep learning and neural network research has grown significantly in the fields of automatic speech ...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...