Modern speaker verification models use deep neural networks to encode utterance audio into discriminative embedding vectors. During the training process, these networks are typically optimized to differentiate arbitrary speakers. This learning process biases the learning of fine voice characteristics towards dominant demographic groups, which can lead to an unfair performance disparity across different groups. This is observed especially with underrepresented demographic groups sharing similar voice characteristics. In this work, we investigate the fairness of speaker verification models on controlled datasets with imbalanced gender distributions, providing direct evidence that model performance suffers for underrepresented groups. To mitig...
Federated learning (FL) has garnered considerable attention due to its privacy-preserving feature. N...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
While promising performance for speaker verification has been achieved by deep speaker embeddings, t...
The human voice conveys unique characteristics of an individual, making voice biometrics a key techn...
Speaker recognition systems are playing a key role in modern online applications. Though the suscept...
Despite the success of deep neural networks (DNNs) in enabling on-device voice assistants, increasin...
We address performance fairness for speaker verification using the adversarial reweighting (ARW) met...
To allow individuals to complete voice-based tasks (e.g., send messages or make payments), modern au...
Effective speaker identification is essential for achieving robust speaker recognition in real-world...
We show that deep networks trained to satisfy demographic parity often do so through a form of race ...
Speaker verification (SV) provides billions of voice-enabled devices with access control, and ensure...
Recent work has emphasized the importance of balancing competing objectives in model training (e.g.,...
Data augmentation is vital to the generalization ability and robustness of deep neural networks (DNN...
Speaker embeddings extracted from neural network (NN) achieve excellent performance on general speak...
Data-driven predictive solutions predominant in commercial applications tend to suffer from biases a...
Federated learning (FL) has garnered considerable attention due to its privacy-preserving feature. N...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
While promising performance for speaker verification has been achieved by deep speaker embeddings, t...
The human voice conveys unique characteristics of an individual, making voice biometrics a key techn...
Speaker recognition systems are playing a key role in modern online applications. Though the suscept...
Despite the success of deep neural networks (DNNs) in enabling on-device voice assistants, increasin...
We address performance fairness for speaker verification using the adversarial reweighting (ARW) met...
To allow individuals to complete voice-based tasks (e.g., send messages or make payments), modern au...
Effective speaker identification is essential for achieving robust speaker recognition in real-world...
We show that deep networks trained to satisfy demographic parity often do so through a form of race ...
Speaker verification (SV) provides billions of voice-enabled devices with access control, and ensure...
Recent work has emphasized the importance of balancing competing objectives in model training (e.g.,...
Data augmentation is vital to the generalization ability and robustness of deep neural networks (DNN...
Speaker embeddings extracted from neural network (NN) achieve excellent performance on general speak...
Data-driven predictive solutions predominant in commercial applications tend to suffer from biases a...
Federated learning (FL) has garnered considerable attention due to its privacy-preserving feature. N...
The objective of this work is to study state-of-the-art deep neural networks based speaker verificat...
While promising performance for speaker verification has been achieved by deep speaker embeddings, t...