In this paper, a hierarchical attention network is proposed to generate robust utterance-level embeddings (H-vectors) for speaker identification and verification. Since different parts of an utterance may have different contributions to speaker identities, the use of hierarchical structure aims to learn speaker related information locally and globally. In the proposed approach, frame-level encoder and attention are applied on segments of an input utterance and generate individual segment vectors. Then, segment level attention is applied on the segment vectors to construct an utterance representation. To evaluate the quality of the learned utterance-level speaker embeddings on speaker identification and verification, the proposed approach is...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Abstract. In this paper, we explore the Input/Output HMh4 (IOHMM) architecture for a substantial pro...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
In this paper, a hierarchical attention network is proposed to generate utterance-level embeddings (...
Identifying multiple speakers without knowing where a speaker’s voice is in a recording is a challen...
In the recent past, Deep neural networks became the most successful approach to extract the speaker ...
Speaker verification (SV) is a task to verify a claimed identity from the voice signal. A well-perfo...
Speaker recognition deals with recognizing speakers by their speech. Most speaker recognition system...
| openaire: EC/H2020/780069/EU//MeMADIn speaker-aware training, a speaker embedding is appended to D...
A modified hierarchical mixtures of experts (HME) architecture is presented for text-dependent speak...
A modified hierarchical mixtures of experts (HME) architecture is presented for text-dependent speak...
Most state-of-the-art Deep Learning systems for text-independent speaker verification are based on s...
Most state-of-the-art Deep Learning (DL) approaches forspeaker recognition work on a short utterance...
In this paper, a novel architecture for speaker recognition is proposed by cascading speech enhancem...
Current speaker verification techniques rely on a neural network to extract speaker representations....
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Abstract. In this paper, we explore the Input/Output HMh4 (IOHMM) architecture for a substantial pro...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
In this paper, a hierarchical attention network is proposed to generate utterance-level embeddings (...
Identifying multiple speakers without knowing where a speaker’s voice is in a recording is a challen...
In the recent past, Deep neural networks became the most successful approach to extract the speaker ...
Speaker verification (SV) is a task to verify a claimed identity from the voice signal. A well-perfo...
Speaker recognition deals with recognizing speakers by their speech. Most speaker recognition system...
| openaire: EC/H2020/780069/EU//MeMADIn speaker-aware training, a speaker embedding is appended to D...
A modified hierarchical mixtures of experts (HME) architecture is presented for text-dependent speak...
A modified hierarchical mixtures of experts (HME) architecture is presented for text-dependent speak...
Most state-of-the-art Deep Learning systems for text-independent speaker verification are based on s...
Most state-of-the-art Deep Learning (DL) approaches forspeaker recognition work on a short utterance...
In this paper, a novel architecture for speaker recognition is proposed by cascading speech enhancem...
Current speaker verification techniques rely on a neural network to extract speaker representations....
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...
Abstract. In this paper, we explore the Input/Output HMh4 (IOHMM) architecture for a substantial pro...
This paper explores three novel approaches to improve the performance of speaker verification (SV) s...