International audienceTraining a speech recognition system needs audio data and their corresponding exact transcriptions. However, manual transcribing is expensive, labor intensive and error-prone. Some sources, such as TV broadcast, have subtitles. Subtitles are closed to the exact transcription, but not exactly the same. Some sentences might be paraphrased, deleted, changed in word order, etc. Building automatic speech recognition from inexact subtitles may result in a poor models and low performance system. Therefore, selecting data is crucial to obtain a highly performance models. In this work, we explore the lightly supervised approach, which is a process to select a good acoustic data to train Deep Neural Network acoustic models. We s...
International audienceMost state-of-the-art speech systems use deep neural networks (DNNs). These sy...
In this thesis, we focus on performance prediction of automatic speech recognition (ASR) systems.Th...
Automatic speech recognition (ASR) technology has matured over the past few decades and has made sig...
This paper compares schemes for the selection of multi-genre broadcast data and corresponding transc...
International audienceThe language model is an important module in many applications that produce na...
Manual transcription of audio databases for the development of automatic speech recognition (ASR) sy...
Manual transcription of audio databases for automatic speech recognition (ASR) training is a costly ...
Manual transcription of audio databases for automatic speech recognition (ASR) training is a costly ...
Automatic speech recognition (ASR) requires a strong language model to guide the acoustic model and ...
Several high-resource Text to Speech (TTS) systems currently produce natural, well-established human...
Self-supervised learning (SSL) has been able to leverage unlabeled data to boost the performance of ...
There is growing recognition of the importance of data-centric methods for building machine learning...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Obtaining sufficient labelled training data is a persistent dif-ficulty for speech recognition resea...
The application of deep neural networks to the task of acoustic modeling for automatic speech recogn...
International audienceMost state-of-the-art speech systems use deep neural networks (DNNs). These sy...
In this thesis, we focus on performance prediction of automatic speech recognition (ASR) systems.Th...
Automatic speech recognition (ASR) technology has matured over the past few decades and has made sig...
This paper compares schemes for the selection of multi-genre broadcast data and corresponding transc...
International audienceThe language model is an important module in many applications that produce na...
Manual transcription of audio databases for the development of automatic speech recognition (ASR) sy...
Manual transcription of audio databases for automatic speech recognition (ASR) training is a costly ...
Manual transcription of audio databases for automatic speech recognition (ASR) training is a costly ...
Automatic speech recognition (ASR) requires a strong language model to guide the acoustic model and ...
Several high-resource Text to Speech (TTS) systems currently produce natural, well-established human...
Self-supervised learning (SSL) has been able to leverage unlabeled data to boost the performance of ...
There is growing recognition of the importance of data-centric methods for building machine learning...
The development of a speech recognition system requires at least three resources: a large labeled sp...
Obtaining sufficient labelled training data is a persistent dif-ficulty for speech recognition resea...
The application of deep neural networks to the task of acoustic modeling for automatic speech recogn...
International audienceMost state-of-the-art speech systems use deep neural networks (DNNs). These sy...
In this thesis, we focus on performance prediction of automatic speech recognition (ASR) systems.Th...
Automatic speech recognition (ASR) technology has matured over the past few decades and has made sig...