Convolutional neural networks (CNNs) have significantly promoted the development of speaker verification (SV) systems because of their powerful deep feature learning capability. In CNN-based SV systems, utterance-level aggregation is an important component, and it compresses the frame-level features generated by the CNN frontend into an utterance-level representation. However, most of the existing aggregation methods aggregate the extracted features across time and cannot capture the speaker-dependent information contained in the frequency domain. To handle this problem, this paper proposes a novel attention-based frequency aggregation method, which focuses on the key frequency bands that provide more information for utterance-level represe...
Despite achieving satisfactory performance in speaker verification using deep neural networks, varia...
Most state-of-the-art Deep Learning (DL) approaches forspeaker recognition work on a short utterance...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
The objective of this paper is speaker recognition `in the wild' - where utterances may be of variab...
This paper presents an improved deep embedding learning method based on convolutional neural network...
In speaker recognition tasks, convolutional neural network (CNN)-based approaches have shown signifi...
Current speaker verification techniques rely on a neural network to extract speaker representations....
Current speaker verification techniques rely on a neural network to extract speaker representations....
The objective of this work is speaker recognition under noisy and unconstrained conditions. We make ...
This paper describes the IDLab submission for the text-independent task of the Short-duration Speake...
The objective of this work is speaker recognition under noisy and unconstrained conditions. We make ...
To extract accurate speaker information for text-independent speaker verification, temporal dynamic ...
Most state-of-the-art Deep Learning (DL) approaches forspeaker recognition work on a short utterance...
Most state-of-the-art Deep Learning systems for text-independent speaker verification are based on s...
Speaker identification with deep learning commonly use time-frequency representation of the voice si...
Despite achieving satisfactory performance in speaker verification using deep neural networks, varia...
Most state-of-the-art Deep Learning (DL) approaches forspeaker recognition work on a short utterance...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...
The objective of this paper is speaker recognition `in the wild' - where utterances may be of variab...
This paper presents an improved deep embedding learning method based on convolutional neural network...
In speaker recognition tasks, convolutional neural network (CNN)-based approaches have shown signifi...
Current speaker verification techniques rely on a neural network to extract speaker representations....
Current speaker verification techniques rely on a neural network to extract speaker representations....
The objective of this work is speaker recognition under noisy and unconstrained conditions. We make ...
This paper describes the IDLab submission for the text-independent task of the Short-duration Speake...
The objective of this work is speaker recognition under noisy and unconstrained conditions. We make ...
To extract accurate speaker information for text-independent speaker verification, temporal dynamic ...
Most state-of-the-art Deep Learning (DL) approaches forspeaker recognition work on a short utterance...
Most state-of-the-art Deep Learning systems for text-independent speaker verification are based on s...
Speaker identification with deep learning commonly use time-frequency representation of the voice si...
Despite achieving satisfactory performance in speaker verification using deep neural networks, varia...
Most state-of-the-art Deep Learning (DL) approaches forspeaker recognition work on a short utterance...
Speaker Recognition (SR) is a common task in AI-based sound analysis, involving structurally differe...