Speaker identification systems in a real-world scenario are tasked to identify a speaker amongst a set of enrolled speakers given just a few samples for each enrolled speaker. This paper demonstrates the effectiveness of meta-learning and relation networks for this use case. We propose improved relation networks for speaker verification and few-shot (unseen) speaker identification. The use of relation networks facilitates joint training of the frontend speaker encoder and the backend model. Inspired by the use of prototypical networks in speaker verification and to increase the discriminability of the speaker embeddings, we train the model to classify samples in the current episode amongst all speakers present in the training set. Furthermo...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...
Voice cloning is a difficult task which requires robust and informative features incorporated in a h...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Meta-learning has recently become a research hotspot in speaker verification (SV). We introduce two ...
State-of-the-art speaker verification systems are inherently dependent on some kind of human supervi...
In Automatic Speech Recognition (ASR) the non-linear data projection provided by a one hidden layer ...
In this technical report, we describe the Royalflush submissions for the VoxCeleb Speaker Recognitio...
In recent years, self-supervised learning paradigm has received extensive attention due to its great...
This work considers training neural networks for speaker recognition with a much smaller dataset siz...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
One approach to speaker adaptation for the neural-network acoustic models of a hybrid connectionist-...
Speaker recognition, recognizing speaker identities based on voice alone, enables important downstre...
This paper presents an improved deep embedding learning method based on convolutional neural network...
Effective speaker identification is essential for achieving robust speaker recognition in real-world...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...
Voice cloning is a difficult task which requires robust and informative features incorporated in a h...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Meta-learning has recently become a research hotspot in speaker verification (SV). We introduce two ...
State-of-the-art speaker verification systems are inherently dependent on some kind of human supervi...
In Automatic Speech Recognition (ASR) the non-linear data projection provided by a one hidden layer ...
In this technical report, we describe the Royalflush submissions for the VoxCeleb Speaker Recognitio...
In recent years, self-supervised learning paradigm has received extensive attention due to its great...
This work considers training neural networks for speaker recognition with a much smaller dataset siz...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
One approach to speaker adaptation for the neural-network acoustic models of a hybrid connectionist-...
Speaker recognition, recognizing speaker identities based on voice alone, enables important downstre...
This paper presents an improved deep embedding learning method based on convolutional neural network...
Effective speaker identification is essential for achieving robust speaker recognition in real-world...
Speaker recognition is one of the field topics widely used in the field of speech technology, many r...
Voice cloning is a difficult task which requires robust and informative features incorporated in a h...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...