In the context of the Neologos French speech database creation project, we have defined a general methodology for the selection of representative speaker recordings. The selection aims at insuring a good coverage in terms of speaker variability while limiting the number of recorded speakers. This makes the resulting database both more adapted to the development of recently proposed multi-model methods and cheaper to collect. The presented methodology proposes to operate a selection by optimizing a quality criterion defined in a variety of speaker similarity modeling frameworks. The selection can be operated and validated with respect to a unique similarity criterion, using classical clustering methods such as Hierarchical or K-Medians clust...
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult ...
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult ...
This paper presents an effective technique for clustering speech utterances based on their associate...
International audienceIn the context of the Neologos French speech database creation project, we hav...
International audienceIn the context of the Neologos French speech database creation project, we hav...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In this work, I investigated structured approaches to data selection for speaker recognition, with a...
This paper investigates the problem of automatically grouping unknown speech utterances based on the...
Large multi-speaker datasets for TTS typically contain diverse speakers, recording conditions, style...
UnrestrictedSpeaker clustering refers to a process of classifying a set of input speech data (or spe...
Abstract. The goal of speaker diarization is to determine where each participant speaks in a recordi...
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult ...
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult ...
This paper presents an effective technique for clustering speech utterances based on their associate...
International audienceIn the context of the Neologos French speech database creation project, we hav...
International audienceIn the context of the Neologos French speech database creation project, we hav...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In the context of the Neologos French speech database creation project, a general methodology was de...
In this work, I investigated structured approaches to data selection for speaker recognition, with a...
This paper investigates the problem of automatically grouping unknown speech utterances based on the...
Large multi-speaker datasets for TTS typically contain diverse speakers, recording conditions, style...
UnrestrictedSpeaker clustering refers to a process of classifying a set of input speech data (or spe...
Abstract. The goal of speaker diarization is to determine where each participant speaks in a recordi...
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult ...
This paper presents the results of the NEOLOGOS project: a children database and an optimized adult ...
This paper presents an effective technique for clustering speech utterances based on their associate...