We address the problem of localized error detection in Automatic Speech Recognition (ASR) output to support the generation of tar-geted clarifications in spoken dialogue sys-tems. Localized error detection finds specific mis-recognized words in a user utterance. Tar-geted clarifications, in contrast with generic ‘please repeat/rephrase ’ clarifications, target a specific mis-recognized word in an utter-ance (Stoyanchev et al., 2012a) and require accurate detection of such words. We extend and modify work presented in (Stoyanchev et al., 2012b) by experimenting with a new set of features for predicting the likelihood of a local error in an ASR hypothesis on an un-sifted version of the original dataset. We im-prove over baseline results, wher...
The present paper evaluates the role selected features and feature combinations play for error detec...
Speech recognition enables users to interact with devices via their voice. However, errors in speech...
Error propagation from automatic speech recognition (ASR) to machine translation (MT) is a critical ...
Thesis (Ph.D.)--University of Washington, 2015Automatic speech recognition (ASR), the transcription ...
Thesis (Ph.D.)--University of Washington, 2021Considering the complexity of speech communicatio...
Over the last years, many advances have been made in the field of Automatic Speech Recognition (ASR)...
Error correction in automatic speech recognition (ASR) aims to correct those incorrect words in sent...
After several decades of effort, speech recognition technologies have made significant progress and ...
International audienceIt is well-known that human listeners significantly outperform machines when i...
This paper proposes two methods for identifying recognition error. The first method is a two-level s...
Automatic speech recognition (ASR) of non-native utterances with grammatical errors is problematic. ...
Abstract With the exponential growth in computing power and progress in speech recognition technolog...
Given the state of the art of current speech technology, errors are unavoidable in present spoken di...
An End-Of-Turn Detection Module (EOTD-M) is an essential component of automatic Spoken Dialogue Syst...
Browsing through large volumes of spoken audio is known to be a challenging task for end users. One ...
The present paper evaluates the role selected features and feature combinations play for error detec...
Speech recognition enables users to interact with devices via their voice. However, errors in speech...
Error propagation from automatic speech recognition (ASR) to machine translation (MT) is a critical ...
Thesis (Ph.D.)--University of Washington, 2015Automatic speech recognition (ASR), the transcription ...
Thesis (Ph.D.)--University of Washington, 2021Considering the complexity of speech communicatio...
Over the last years, many advances have been made in the field of Automatic Speech Recognition (ASR)...
Error correction in automatic speech recognition (ASR) aims to correct those incorrect words in sent...
After several decades of effort, speech recognition technologies have made significant progress and ...
International audienceIt is well-known that human listeners significantly outperform machines when i...
This paper proposes two methods for identifying recognition error. The first method is a two-level s...
Automatic speech recognition (ASR) of non-native utterances with grammatical errors is problematic. ...
Abstract With the exponential growth in computing power and progress in speech recognition technolog...
Given the state of the art of current speech technology, errors are unavoidable in present spoken di...
An End-Of-Turn Detection Module (EOTD-M) is an essential component of automatic Spoken Dialogue Syst...
Browsing through large volumes of spoken audio is known to be a challenging task for end users. One ...
The present paper evaluates the role selected features and feature combinations play for error detec...
Speech recognition enables users to interact with devices via their voice. However, errors in speech...
Error propagation from automatic speech recognition (ASR) to machine translation (MT) is a critical ...