This paper presents an effective technique for clustering speech utterances based on their associated speaker. In attempts to determine which utterances are from the same speakers, a prerequisite is to measure the similarity of voice characteristics between utterances. Since the vast majority of existing methods evaluate the inter-utterance similarity by taking only the information from the spectrum-based features of utterance pairs into account, the resulting clusters may not be well relevant to speaker, but instead likely to the environmental conditions or other acoustic classes. To compensate for this shortcoming, this study proposes to project utterances from their spectrum-based feature representation onto a reference space trained to ...
In the context of the Neologos French speech database creation project, we have defined a general me...
International audienceIn the context of the Neologos French speech database creation project, we hav...
Gaussian mixture models (GMMs) are commonly used in text-independent speaker identification systems....
This paper investigates the problem of automatically grouping unknown speech utterances based on the...
This study investigates the problem of automatically grouping unknown speech utterances based on the...
Forensic speaker recognition (FSR) is the process of determining whether the source of a questioned ...
UnrestrictedSpeaker clustering refers to a process of classifying a set of input speech data (or spe...
Proceedings of the 3rd Nordic Symposium on Multimodal Communication. Editors: Patrizia Paggio, Elis...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
Forensic speaker recognition (FSR) is the process of determining whether the source of a questioned ...
Overlapped speech, where several speakers are speaking simultaneously, is a common occurence in mult...
In the context of the Neologos French speech database creation project, we have defined a general me...
International audienceIn the context of the Neologos French speech database creation project, we hav...
Gaussian mixture models (GMMs) are commonly used in text-independent speaker identification systems....
This paper investigates the problem of automatically grouping unknown speech utterances based on the...
This study investigates the problem of automatically grouping unknown speech utterances based on the...
Forensic speaker recognition (FSR) is the process of determining whether the source of a questioned ...
UnrestrictedSpeaker clustering refers to a process of classifying a set of input speech data (or spe...
Proceedings of the 3rd Nordic Symposium on Multimodal Communication. Editors: Patrizia Paggio, Elis...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
Forensic speaker recognition (FSR) is the process of determining whether the source of a questioned ...
Overlapped speech, where several speakers are speaking simultaneously, is a common occurence in mult...
In the context of the Neologos French speech database creation project, we have defined a general me...
International audienceIn the context of the Neologos French speech database creation project, we hav...
Gaussian mixture models (GMMs) are commonly used in text-independent speaker identification systems....