Text-independent speaker recognition systems such as those based on Gaussian mixture models (GMMs) do not include time sequence information (TSI) within the model itself. The level of importance of TSI in speaker recognition is an interesting question and one addressed in this paper. Recent works has shown that the utilisation of higher-level information such as idiolect, pronunciation, and prosodics can be useful in reducing speaker recognition error rates. In accordance with these developments, the aim of this paper is to show that as more data becomes available, the basic GMM can be enhanced by utilising TSI, even in a text-independent mode. This paper presents experimental work incorporating TSI into the conventional GMM. The resulting...
In this paper, we investigate on the role of dynamic information on the performances of AR-vector mo...
In this paper, we investigate on the role of dynamic information on the performances of AR-vector mo...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...
In this paper,features for text-independent speaker recognition has been evaluated. Speaker identifi...
We present a method for speaker recognition that uses the duration patterns of speech units to aid s...
This article describes a general and powerful approach to modelling mismatch in speaker recognition ...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
International audienceThe performance of speaker recognition system is highly dependent on the amoun...
Given a speech signal there are two kinds of information that may be extracted from it. On one hand ...
This paper discusses the research directions pursued jointly at the Anthropic Signal Processing Grou...
Conference PaperIn this paper, we investigate on the role of dynamic information on the performances...
This paper develops an effective and efficient scheme to integrate Gaussian mixture model (GMM), sup...
This paper examines the usefulness of a multilingual broad syllable-based framework for text-indepen...
This paper examines the usefulness of a multilingual broad syllable-based framework for text-indepen...
This paper proposes some methods of robust text-independent speaker identification based on Gaussian...
In this paper, we investigate on the role of dynamic information on the performances of AR-vector mo...
In this paper, we investigate on the role of dynamic information on the performances of AR-vector mo...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...
In this paper,features for text-independent speaker recognition has been evaluated. Speaker identifi...
We present a method for speaker recognition that uses the duration patterns of speech units to aid s...
This article describes a general and powerful approach to modelling mismatch in speaker recognition ...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
International audienceThe performance of speaker recognition system is highly dependent on the amoun...
Given a speech signal there are two kinds of information that may be extracted from it. On one hand ...
This paper discusses the research directions pursued jointly at the Anthropic Signal Processing Grou...
Conference PaperIn this paper, we investigate on the role of dynamic information on the performances...
This paper develops an effective and efficient scheme to integrate Gaussian mixture model (GMM), sup...
This paper examines the usefulness of a multilingual broad syllable-based framework for text-indepen...
This paper examines the usefulness of a multilingual broad syllable-based framework for text-indepen...
This paper proposes some methods of robust text-independent speaker identification based on Gaussian...
In this paper, we investigate on the role of dynamic information on the performances of AR-vector mo...
In this paper, we investigate on the role of dynamic information on the performances of AR-vector mo...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...