In this paper a new linguistically-motivated front-end is presented showing major performance improvements from the use of session variability compensated cepstral trajectories in phone units. Extending our recent work on temporal contours in linguistic units (TCLU), we have combined the potential of those unit-dependent trajectories with the ability of feature domain factor analysis techniques to compensate session variability effects, which has resulted in consistent and discriminant phone-dependent trajectories across different recording sessions. Evaluating with NIST SRE04 English-only 1s1s task, we report EERs as low as 5.40 % from the trajectories in a single phone, with 29 different phones producing each of them EERs smaller than 10%...
Presented is an approach to modelling session variability for GMM-based text-independent speaker ver...
Embedded speaker recognition in mobile devices could involve several ergonomic constraints and a lim...
Many of the language identification (LID) systems are based on language models using machine learnin...
Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses...
Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses...
The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-35292-8_3Proc...
Recent studies show that phonetic sequences from multiple languages can provide effective features f...
We present a method for speaker recognition that uses the duration patterns of speech units to aid s...
We improve upon our measures relating feature vector distri-butions to speaker recognition (SR) perf...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
We explore how intrinsic variations (those associated with the speaker rather than the recording env...
This paper presents a new approach to feature-level phone normalisation which aims to improve speake...
This paper presents new techniques with relevant improvements added to the primary system presented ...
This thesis addresses the problem of speech phone recognition. Phones are the acoustic sounds of spe...
In this paper, we investigate the use of invariant features for speaker recognition. Owing to their ...
Presented is an approach to modelling session variability for GMM-based text-independent speaker ver...
Embedded speaker recognition in mobile devices could involve several ergonomic constraints and a lim...
Many of the language identification (LID) systems are based on language models using machine learnin...
Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses...
Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses...
The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-35292-8_3Proc...
Recent studies show that phonetic sequences from multiple languages can provide effective features f...
We present a method for speaker recognition that uses the duration patterns of speech units to aid s...
We improve upon our measures relating feature vector distri-butions to speaker recognition (SR) perf...
A state-of-the-art automatic speech recognition (ASR) system can often achieve high accuracy for mos...
We explore how intrinsic variations (those associated with the speaker rather than the recording env...
This paper presents a new approach to feature-level phone normalisation which aims to improve speake...
This paper presents new techniques with relevant improvements added to the primary system presented ...
This thesis addresses the problem of speech phone recognition. Phones are the acoustic sounds of spe...
In this paper, we investigate the use of invariant features for speaker recognition. Owing to their ...
Presented is an approach to modelling session variability for GMM-based text-independent speaker ver...
Embedded speaker recognition in mobile devices could involve several ergonomic constraints and a lim...
Many of the language identification (LID) systems are based on language models using machine learnin...