The effectiveness of higher-order spectral (HOS) phase features in speaker recognition is investigated by comparison with Mel Cepstral features on the same speech data. HOS phase features retain phase information from the Fourier spectrum unlikeMel–frequency Cepstral coefficients (MFCC). Gaussian mixture models are constructed from Mel– Cepstral features and HOS features, respectively, for the same data from various speakers in the Switchboard telephone Speech Corpus. Feature clusters, model parameters and classification performance are analyzed. HOS phase features on their own provide a correct identification rate of about 97% on the chosen subset of the corpus. This is the same level of accuracy as provided by MFCCs. Cluster plots and mod...
This paper describes a unique cross-phoneme speaker identification experiment, using deliberately mi...
This paper investigates the task of SR (Speaker Recognition) for the state-of-the-art techniques. Th...
We propose a novel feature set for speaker recognition that is based on the voice source signal. The...
The objective of this letter is to demonstrate the complementary nature of speaker-specific informat...
Mel-frequency cepstral coefficients (MFCCs) are widely adopted in speech recognition as well as spea...
Speaker recognition system is intended to recognize a person’s identity. This task can be done by kn...
A state of the art Speaker Identification (SI) system requires a robust feature extraction unit foll...
AbstractSpeaker identification system identifies the person by his/her speech sample. Speaker Identi...
Despite recent advances, improving the accuracy of automatic speaker recognition systems remains an ...
www.imm.dtu.dk This Master’s thesis presents an investigation of the features and models used when c...
fusion, prosodic features Abstract—As the digital divide between man and information decreases, pers...
The idea of the Speaker Recognition Project is to implement a recognizer which might determine an in...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-b...
For several reasons, the Fourier phase domain is less favored than the magnitude domain in signal pr...
Gaussian mixture models (GMMs) are commonly used in text-independent speaker identification systems....
This paper describes a unique cross-phoneme speaker identification experiment, using deliberately mi...
This paper investigates the task of SR (Speaker Recognition) for the state-of-the-art techniques. Th...
We propose a novel feature set for speaker recognition that is based on the voice source signal. The...
The objective of this letter is to demonstrate the complementary nature of speaker-specific informat...
Mel-frequency cepstral coefficients (MFCCs) are widely adopted in speech recognition as well as spea...
Speaker recognition system is intended to recognize a person’s identity. This task can be done by kn...
A state of the art Speaker Identification (SI) system requires a robust feature extraction unit foll...
AbstractSpeaker identification system identifies the person by his/her speech sample. Speaker Identi...
Despite recent advances, improving the accuracy of automatic speaker recognition systems remains an ...
www.imm.dtu.dk This Master’s thesis presents an investigation of the features and models used when c...
fusion, prosodic features Abstract—As the digital divide between man and information decreases, pers...
The idea of the Speaker Recognition Project is to implement a recognizer which might determine an in...
The spectral parameters that result from filtering the frequency sequence of log mel-scaled filter-b...
For several reasons, the Fourier phase domain is less favored than the magnitude domain in signal pr...
Gaussian mixture models (GMMs) are commonly used in text-independent speaker identification systems....
This paper describes a unique cross-phoneme speaker identification experiment, using deliberately mi...
This paper investigates the task of SR (Speaker Recognition) for the state-of-the-art techniques. Th...
We propose a novel feature set for speaker recognition that is based on the voice source signal. The...