In real-world applications, speaker recognition models often face various domain-mismatch challenges, leading to a significant drop in performance. Although numerous domain adaptation techniques have been developed to address this issue, almost all present methods focus on a simple configuration where the model is trained in one domain and deployed in another. However, real-world environments are often complex and may contain multiple domains, making the methods designed for one-to-one adaptation suboptimal. In our paper, we propose a self-supervised learning method to tackle this multi-domain adaptation problem. Building upon the basic self-supervised adaptation algorithm, we designed three strategies to make it suitable for multi-domain a...
For self-supervised speaker verification, the quality of pseudo labels decides the upper bound of th...
While Automatic Speech Recognition (ASR) models have shown significant advances with the introductio...
Multi-genre speaker recognition is becoming increasingly popular due to its ability to better repres...
The cross-domain performance of automatic speech recognition (ASR) could be severely hampered due to...
Speech distortions are a long-standing problem that degrades the performance of supervisely trained ...
Automatic speech recognition models are often adapted to improve their accuracy in a new domain. A p...
Performance of automatic speaker verification (ASV) systems is very sensitive to mismatch between tr...
In recent years, self-supervised learning paradigm has received extensive attention due to its great...
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non...
Speaker recognition, recognizing speaker identities based on voice alone, enables important downstre...
Modern speech recognition systems exhibits rapid performance degradation under domain shift. This is...
State-of-the-art speaker verification systems are inherently dependent on some kind of human supervi...
Domain mismatch occurs when data from application-specific target domain is related to, but cannot be...
While self-supervised speech representation learning (SSL) models serve a variety of downstream task...
Unsupervised domain adaptation using adversarial learning has shown promise in adapting speech model...
For self-supervised speaker verification, the quality of pseudo labels decides the upper bound of th...
While Automatic Speech Recognition (ASR) models have shown significant advances with the introductio...
Multi-genre speaker recognition is becoming increasingly popular due to its ability to better repres...
The cross-domain performance of automatic speech recognition (ASR) could be severely hampered due to...
Speech distortions are a long-standing problem that degrades the performance of supervisely trained ...
Automatic speech recognition models are often adapted to improve their accuracy in a new domain. A p...
Performance of automatic speaker verification (ASV) systems is very sensitive to mismatch between tr...
In recent years, self-supervised learning paradigm has received extensive attention due to its great...
The state-of-the-art i-vector based probabilistic linear discriminant analysis (PLDA) trained on non...
Speaker recognition, recognizing speaker identities based on voice alone, enables important downstre...
Modern speech recognition systems exhibits rapid performance degradation under domain shift. This is...
State-of-the-art speaker verification systems are inherently dependent on some kind of human supervi...
Domain mismatch occurs when data from application-specific target domain is related to, but cannot be...
While self-supervised speech representation learning (SSL) models serve a variety of downstream task...
Unsupervised domain adaptation using adversarial learning has shown promise in adapting speech model...
For self-supervised speaker verification, the quality of pseudo labels decides the upper bound of th...
While Automatic Speech Recognition (ASR) models have shown significant advances with the introductio...
Multi-genre speaker recognition is becoming increasingly popular due to its ability to better repres...