ASR Error Detection (AED) models aim to post-process the output of Automatic Speech Recognition (ASR) systems, in order to detect transcription errors. Modern approaches usually use text-based input, comprised solely of the ASR transcription hypothesis, disregarding additional signals from the ASR model. Instead, we propose to utilize the ASR system's word-level confidence scores for improving AED performance. Specifically, we add an ASR Confidence Embedding (ACE) layer to the AED model's encoder, allowing us to jointly encode the confidence scores and the transcribed text into a contextualized representation. Our experiments show the benefits of ASR confidence scores for AED, their complementary effect over the textual signal, as well as t...
International audienceRetrieving the syntactic structure of erroneous ASR transcriptions can be of g...
Error propagation from automatic speech recognition (ASR) to machine translation (MT) is a critical ...
Despite improved performances of the latest Automatic Speech Recognition (ASR) systems, transcriptio...
High quality transcription data is crucial for training automatic speech recognition (ASR) systems. ...
Over the last years, many advances have been made in the field of Automatic Speech Recognition (ASR)...
Over the last years, many advances have been made in the field of Automatic Speech Recognition (ASR)...
Evaluation of automatic speech recognition (ASR) systems is difficult and costly, since it requires ...
International audienceThis paper addresses the problem of automatic speech recognition (ASR) error d...
Retrieving the syntactic structure of erroneous ASR transcriptions can be of great interest for open...
International audienceEvaluating transcriptions from automatic speech recognition (ASR) systems is a...
International audienceThe standard metric to evaluate automatic speech recognition (ASR) systems is ...
International audienceThe standard metric to evaluate automatic speech recognition (ASR) systems is ...
International audienceRetrieving the syntactic structure of erroneous ASR transcriptions can be of g...
We address the problem of estimating the quality of Automatic Speech Recognition (ASR) out-put at ut...
International audienceRetrieving the syntactic structure of erroneous ASR transcriptions can be of g...
International audienceRetrieving the syntactic structure of erroneous ASR transcriptions can be of g...
Error propagation from automatic speech recognition (ASR) to machine translation (MT) is a critical ...
Despite improved performances of the latest Automatic Speech Recognition (ASR) systems, transcriptio...
High quality transcription data is crucial for training automatic speech recognition (ASR) systems. ...
Over the last years, many advances have been made in the field of Automatic Speech Recognition (ASR)...
Over the last years, many advances have been made in the field of Automatic Speech Recognition (ASR)...
Evaluation of automatic speech recognition (ASR) systems is difficult and costly, since it requires ...
International audienceThis paper addresses the problem of automatic speech recognition (ASR) error d...
Retrieving the syntactic structure of erroneous ASR transcriptions can be of great interest for open...
International audienceEvaluating transcriptions from automatic speech recognition (ASR) systems is a...
International audienceThe standard metric to evaluate automatic speech recognition (ASR) systems is ...
International audienceThe standard metric to evaluate automatic speech recognition (ASR) systems is ...
International audienceRetrieving the syntactic structure of erroneous ASR transcriptions can be of g...
We address the problem of estimating the quality of Automatic Speech Recognition (ASR) out-put at ut...
International audienceRetrieving the syntactic structure of erroneous ASR transcriptions can be of g...
International audienceRetrieving the syntactic structure of erroneous ASR transcriptions can be of g...
Error propagation from automatic speech recognition (ASR) to machine translation (MT) is a critical ...
Despite improved performances of the latest Automatic Speech Recognition (ASR) systems, transcriptio...