Accented speech that is under-represented in the training data still suffers high Word Error Rate (WER) with state-of-the-art Automatic Speech Recognition (ASR) systems. Careful collection and tran-scription of training data for different accents can address this issue, but it is both time consuming and expensive. However, for many tasks such as broadcast news or voice search, it is easy to obtain large amounts of audio data from target users with representative accents, albeit without accent labels or even transcriptions. Semi-supervised training have been explored for ASR in the past to leverage such data, but many of these techniques assume homogeneous training and test conditions. In this paper, we experiment with cross-entropy based sp...
Several adaptation approaches have been proposed in an eort to improve the speech recognition perfor...
This paper investigates the potential of improving a hybrid automatic speech recognition model train...
State-of-the-art automatic speech recognition (ASR) systems use sequence-level objectives like Conne...
Accented speech that is under-represented in the training data still suffers high Word Error Rate (W...
<p>Accented speech that is under-represented in the training data still suffers high Word Error Rate...
We experiment with active learning for speech recognition in the context of accent adaptation. We ad...
<p>We experiment with active learning for speech recognition in the context of accent adaptation. We...
Automatic speech recognition (ASR) systems have seen substantial improvements in the past decade; ho...
International audienceCurrent automatic speech recognition (ASR) systems trained on native speech of...
State-of-the-art Automatic Speech Recognition (ASR) models struggle to handle accented speech, parti...
Automatic Speech Recognition (ASR) systems have seen substantial improvements in the past decade; ho...
Manual transcription of audio databases for automatic speech recognition (ASR) training is a costly ...
This paper is concerned with automatic speech recognition (ASR) for accented speech. Given a small a...
The importance of Automatic Speech Recognition (ASR) Systems, whose job is to generate text from aud...
Automatic speech recognition (ASR) technology has matured over the past few decades and has made sig...
Several adaptation approaches have been proposed in an eort to improve the speech recognition perfor...
This paper investigates the potential of improving a hybrid automatic speech recognition model train...
State-of-the-art automatic speech recognition (ASR) systems use sequence-level objectives like Conne...
Accented speech that is under-represented in the training data still suffers high Word Error Rate (W...
<p>Accented speech that is under-represented in the training data still suffers high Word Error Rate...
We experiment with active learning for speech recognition in the context of accent adaptation. We ad...
<p>We experiment with active learning for speech recognition in the context of accent adaptation. We...
Automatic speech recognition (ASR) systems have seen substantial improvements in the past decade; ho...
International audienceCurrent automatic speech recognition (ASR) systems trained on native speech of...
State-of-the-art Automatic Speech Recognition (ASR) models struggle to handle accented speech, parti...
Automatic Speech Recognition (ASR) systems have seen substantial improvements in the past decade; ho...
Manual transcription of audio databases for automatic speech recognition (ASR) training is a costly ...
This paper is concerned with automatic speech recognition (ASR) for accented speech. Given a small a...
The importance of Automatic Speech Recognition (ASR) Systems, whose job is to generate text from aud...
Automatic speech recognition (ASR) technology has matured over the past few decades and has made sig...
Several adaptation approaches have been proposed in an eort to improve the speech recognition perfor...
This paper investigates the potential of improving a hybrid automatic speech recognition model train...
State-of-the-art automatic speech recognition (ASR) systems use sequence-level objectives like Conne...