Performance of automatic speaker verification (ASV) systems is very sensitive to mismatch between training (source) and testing (target) domains. The best way to address domain mismatch is to perform matched condition training – gather sufficient labeled samples from the target domain and use them in training. However, in many cases this is too expensive or impractical. Usually, gaining access to unlabeled target domain data, e.g., from open source online media, and labeled data from other domains is more feasible. This work focuses on making ASV systems robust to uncontrolled (‘wild’) conditions, with the help of some unlabeled data acquired from such conditions. Given acoustic features from both domains, we propose learning a mapping func...
Face recognition has been receiving consistent attention in computer vision community for over three...
The performance of speech recognition systems is known to degrade in mismatched conditions, where th...
International audienceMy talk will focus on robustness to background noise in distant-microphone spe...
With the increasing use of voice as a biometric, it has become imperative to develop countermeasures...
In real-world applications, speaker recognition models often face various domain-mismatch challenges...
Unsupervised domain adaptation using adversarial learning has shown promise in adapting speech model...
Machine learning algorithms have achieved the state-of-the-art results by utilizing deep neural netw...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
Speech enhancement aims to suppress background noise in noisy speech signals in order to improve spe...
The cross-domain performance of automatic speech recognition (ASR) could be severely hampered due to...
Automatic speech recognition models are often adapted to improve their accuracy in a new domain. A p...
A speaker embeddings framework achieves state-of-the-art speaker recognition performance by modeling...
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non...
Distribution mismatches between the data seen at training and at application time remain a major cha...
The performance of deep learning approaches to speech enhancement degrades significantly in face of ...
Face recognition has been receiving consistent attention in computer vision community for over three...
The performance of speech recognition systems is known to degrade in mismatched conditions, where th...
International audienceMy talk will focus on robustness to background noise in distant-microphone spe...
With the increasing use of voice as a biometric, it has become imperative to develop countermeasures...
In real-world applications, speaker recognition models often face various domain-mismatch challenges...
Unsupervised domain adaptation using adversarial learning has shown promise in adapting speech model...
Machine learning algorithms have achieved the state-of-the-art results by utilizing deep neural netw...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
Speech enhancement aims to suppress background noise in noisy speech signals in order to improve spe...
The cross-domain performance of automatic speech recognition (ASR) could be severely hampered due to...
Automatic speech recognition models are often adapted to improve their accuracy in a new domain. A p...
A speaker embeddings framework achieves state-of-the-art speaker recognition performance by modeling...
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non...
Distribution mismatches between the data seen at training and at application time remain a major cha...
The performance of deep learning approaches to speech enhancement degrades significantly in face of ...
Face recognition has been receiving consistent attention in computer vision community for over three...
The performance of speech recognition systems is known to degrade in mismatched conditions, where th...
International audienceMy talk will focus on robustness to background noise in distant-microphone spe...