Given the state of the art of current speech technology, errors are unavoidable in present spoken dialogue systems. Therefore, one of the main concerns in dialogue design is how to decide whether or not the system has understood the user correctly. We identify methods to distinguish between correctly and incorrectly recognized utterances, scored by hand for semantic concept accuracy, using acoustic/prosodic characteristics. The analysis was performed on data collected during independent experiments done with an interactive voice response system that provides travel information over the phone
In this work, we describe how prosodic information can be employed to improve the performance of an ...
The present paper evaluates the role selected features and feature combinations play for error detec...
We present an analysis of several pub-licly available automatic speech recogniz-ers (ASRs) in terms ...
Given the state of the art of current speech technology, errors are unavoidable in present spoken di...
International audienceIt is well-known that human listeners significantly outperform machines when i...
Thesis (Ph.D.)--University of Washington, 2021Considering the complexity of speech communicatio...
Given the state of the art of current language and speech technology, errors are unavoidable in pres...
International audienceThis study explores automatic speech recognition (ASR) errors from a syntax-pr...
Many application environments have already usedspeech interface. But the low speech recognition rate...
Word error rate (WER), which is the most commonly used method of measuring automatic speech recognit...
An End-Of-Turn Detection Module (EOTD-M) is an essential component of automatic Spoken Dialogue Syst...
It is well-known that human listeners significantly outperform machines when it comes to transcribin...
We address the problem of localized error detection in Automatic Speech Recognition (ASR) output to ...
In this work, we describe how prosodic information can be employed to improve the performance of an ...
In free speaking tests candidates respond in spontaneous speech to prompts. This form of test allows...
In this work, we describe how prosodic information can be employed to improve the performance of an ...
The present paper evaluates the role selected features and feature combinations play for error detec...
We present an analysis of several pub-licly available automatic speech recogniz-ers (ASRs) in terms ...
Given the state of the art of current speech technology, errors are unavoidable in present spoken di...
International audienceIt is well-known that human listeners significantly outperform machines when i...
Thesis (Ph.D.)--University of Washington, 2021Considering the complexity of speech communicatio...
Given the state of the art of current language and speech technology, errors are unavoidable in pres...
International audienceThis study explores automatic speech recognition (ASR) errors from a syntax-pr...
Many application environments have already usedspeech interface. But the low speech recognition rate...
Word error rate (WER), which is the most commonly used method of measuring automatic speech recognit...
An End-Of-Turn Detection Module (EOTD-M) is an essential component of automatic Spoken Dialogue Syst...
It is well-known that human listeners significantly outperform machines when it comes to transcribin...
We address the problem of localized error detection in Automatic Speech Recognition (ASR) output to ...
In this work, we describe how prosodic information can be employed to improve the performance of an ...
In free speaking tests candidates respond in spontaneous speech to prompts. This form of test allows...
In this work, we describe how prosodic information can be employed to improve the performance of an ...
The present paper evaluates the role selected features and feature combinations play for error detec...
We present an analysis of several pub-licly available automatic speech recogniz-ers (ASRs) in terms ...