International audienceMost of the speech processing applications use triangular filters spaced in mel-scale for feature extraction. In this paper, we propose a new data-driven filter design method which optimizes filter parameters from a given speech data. First, we introduce a frame-selection based approach for developing speech-signal-based frequency warping scale. Then, we propose a new method for computing the filter frequency responses by using principal component analysis (PCA). The main advantage of the proposed method over the recently introduced deep learning based methods is that it requires very limited amount of unlabeled speech-data. We demonstrate that the proposed filterbank has more speaker discriminative power than commonly...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-b...
markskow,harris @ cnel.ufl.edu The most popular speech feature extractor used in auto-matic speech r...
AbstractSpeaker identification system identifies the person by his/her speech sample. Speaker Identi...
The most popular speech feature extractor used in automatic speech recognition (ASR) systems today i...
Speaker recognition is defined as to make sure that if the person is the same person he claims to be...
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but t...
Identity verification or biometric recognition systems play an important role in our dailylives. App...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
Interfering noise severely degrades the performance of a speaker verification system. The Parallel M...
The standard approach to speaker verification is to extract cepstral features from the speech spectr...
Automatic speaker recognition algorithms typically use pre-defined filterbanks, such as Mel-Frequenc...
This thesis describes the development of a robust automatic speaker verification system (ASV) with s...
Traditional and current speaker recognition systems primarily use low-level (physiological) features...
International audienceModern automatic speaker verification relies largely on deep neural networks (...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-b...
markskow,harris @ cnel.ufl.edu The most popular speech feature extractor used in auto-matic speech r...
AbstractSpeaker identification system identifies the person by his/her speech sample. Speaker Identi...
The most popular speech feature extractor used in automatic speech recognition (ASR) systems today i...
Speaker recognition is defined as to make sure that if the person is the same person he claims to be...
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but t...
Identity verification or biometric recognition systems play an important role in our dailylives. App...
Advancements in automatic speaker verification (ASV) can be considered to be primarily limited to im...
Interfering noise severely degrades the performance of a speaker verification system. The Parallel M...
The standard approach to speaker verification is to extract cepstral features from the speech spectr...
Automatic speaker recognition algorithms typically use pre-defined filterbanks, such as Mel-Frequenc...
This thesis describes the development of a robust automatic speaker verification system (ASV) with s...
Traditional and current speaker recognition systems primarily use low-level (physiological) features...
International audienceModern automatic speaker verification relies largely on deep neural networks (...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-b...
markskow,harris @ cnel.ufl.edu The most popular speech feature extractor used in auto-matic speech r...