The dataset we have compiled for our research on "Setting up a speech recognition model for under-resourced languages" consists of audio recordings of Dioula speakers pronouncing the numbers 1, 2, 3, and 4. These recordings were collected under various conditions, featuring variability in speakers, accents, and environmental contexts. The data has been categorized into four distinct classes, each corresponding to one of the numbers (1, 2, 3, or 4), enabling the training and evaluation of a machine learning-based speech recognition model
DVoice is a community initiative that aims to provide African languages and dialects with data and m...
This is speech dataset for the Sudanese dialect data been collected from YouTube videos represent th...
In this work, I investigated structured approaches to data selection for speaker recognition, with a...
© 2017. The Author(s). For purposes of automated speech recognition in under-resourced environments,...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
International audienceMost speech and language technologies are trained with massive amounts of spee...
The dataset was created to enable research on automatic speech recognition in Boulé (Baule) language...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Many of the language identification (LID) systems are based on language models using machine learnin...
Language identification is an important issue in many speech applica-tions. We address this problem ...
For many of the 700 million illiterate people around the world, speech recognition technology could ...
Abstract: This paper addresses the problem of speech recognition to identify various modes of speech...
Prior speech and linguistics research has focused on the use of phonemes recognition in speech, and ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
International audienceAutomatic speech processing technologies hold great potential to facilitate th...
DVoice is a community initiative that aims to provide African languages and dialects with data and m...
This is speech dataset for the Sudanese dialect data been collected from YouTube videos represent th...
In this work, I investigated structured approaches to data selection for speaker recognition, with a...
© 2017. The Author(s). For purposes of automated speech recognition in under-resourced environments,...
One particular problem in large vocabulary continuous speech recognition for low-resourced languages...
International audienceMost speech and language technologies are trained with massive amounts of spee...
The dataset was created to enable research on automatic speech recognition in Boulé (Baule) language...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Many of the language identification (LID) systems are based on language models using machine learnin...
Language identification is an important issue in many speech applica-tions. We address this problem ...
For many of the 700 million illiterate people around the world, speech recognition technology could ...
Abstract: This paper addresses the problem of speech recognition to identify various modes of speech...
Prior speech and linguistics research has focused on the use of phonemes recognition in speech, and ...
Vocal languages across the world are estimated to be approximately 6000, yet only a handful of them ...
International audienceAutomatic speech processing technologies hold great potential to facilitate th...
DVoice is a community initiative that aims to provide African languages and dialects with data and m...
This is speech dataset for the Sudanese dialect data been collected from YouTube videos represent th...
In this work, I investigated structured approaches to data selection for speaker recognition, with a...