DNN-based speaker verification (SV) models demonstrate significant performance at relatively high computation costs. Model compression can be applied to reduce the model size for lower resource consumption. The present study exploits weight quantization to compress two widely-used SV models, namely ECAPA-TDNN and ResNet. Experimental results on VoxCeleb show that weight quantization is effective for compressing SV models. The model size can be reduced multiple times without noticeable degradation in performance. Compression of ResNet shows more robust results than ECAPA-TDNN with lower-bitwidth quantization. Analysis of the layer weights suggests that the smooth weight distribution of ResNet may be related to its better robustness. The gene...
Speaker verification (SV) provides billions of voice-enabled devices with access control, and ensure...
Deep neural networks (DNN) have achieved impressive success in multiple domains. Over the years, the...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
State-of-the-art speaker verification systems are inherently dependent on some kind of human supervi...
In recent years, self-supervised learning paradigm has received extensive attention due to its great...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Deep neural networks (DNNs) continue to make significant advances, solving tasks from image classifi...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Recent advances with self-supervised learning have allowed speech recognition systems to achieve sta...
Data augmentation is vital to the generalization ability and robustness of deep neural networks (DNN...
Current speaker verification techniques rely on a neural network to extract speaker representations....
This paper presents an improved deep embedding learning method based on convolutional neural network...
This paper proposes weight regularization for a faster neural vocoder. Pruning time-consuming DNN mo...
This report describes our speaker verification systems for the tasks of the CN-Celeb Speaker Recogni...
Speaker verification (SV) provides billions of voice-enabled devices with access control, and ensure...
Deep neural networks (DNN) have achieved impressive success in multiple domains. Over the years, the...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...
State-of-the-art speaker verification systems are inherently dependent on some kind of human supervi...
In recent years, self-supervised learning paradigm has received extensive attention due to its great...
In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-depen...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Deep neural networks (DNNs) continue to make significant advances, solving tasks from image classifi...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Recent advances with self-supervised learning have allowed speech recognition systems to achieve sta...
Data augmentation is vital to the generalization ability and robustness of deep neural networks (DNN...
Current speaker verification techniques rely on a neural network to extract speaker representations....
This paper presents an improved deep embedding learning method based on convolutional neural network...
This paper proposes weight regularization for a faster neural vocoder. Pruning time-consuming DNN mo...
This report describes our speaker verification systems for the tasks of the CN-Celeb Speaker Recogni...
Speaker verification (SV) provides billions of voice-enabled devices with access control, and ensure...
Deep neural networks (DNN) have achieved impressive success in multiple domains. Over the years, the...
• Implement a high-accuracy text-dependent/short-duration speaker id system • Exploit Deep Neural Ne...