This doctoral thesis is the result of a research effort performed in two fields of speech technology, i.e., speech recognition and mispronunciation detection. Although the two areas are clearly distinguishable, the proposed approaches share a common hypothesis based on psychoacoustic processing of speech signals. The conjecture implies that the human auditory periphery provides a relatively good separation of different sound classes. Hence, it is possible to use recent findings from psychoacoustic perception together with mathematical and computational tools to model the auditory sensitivities to small speech signal changes. The performance of an automatic speech recognition system strongly depends on the representation used for the front-e...
Recently, Li et al. proposed a new auditory feature for robust speech recognition in noise environme...
This thesis presents methods and results for optimizing subword detectors in continuous speech. Spee...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
The areas of mispronunciation detection (or accent detection more specifically) within the speec...
A regular automatic speech recognizer works with a so-called recognition lexicon. This lexicon conta...
The areas of mispronunciation detection (or accent detection more specifically) within the speec...
The authors address the question whether phonological features can be used effectively in an automat...
Abstract Computer-aided pronunciation training (CAPT) technologies enable the use of automatic speec...
Automatic Speech Recognition (ASR) can be very useful in language learning tools in order to correct...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...
Recently, Li et al. proposed a new auditory feature for robust speech recognition in noise environme...
In this paper, we investigate the problem of mispronuncia-tion detection by considering the influenc...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
Motivation for Speaker recognition work is presented in the first part of the thesis. An exhaustive ...
Recently, Li et al. proposed a new auditory feature for robust speech recognition in noise environme...
This thesis presents methods and results for optimizing subword detectors in continuous speech. Spee...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...
The performance of an automatic speech recognition (ASR) system strongly depends on the representati...
The areas of mispronunciation detection (or accent detection more specifically) within the speec...
A regular automatic speech recognizer works with a so-called recognition lexicon. This lexicon conta...
The areas of mispronunciation detection (or accent detection more specifically) within the speec...
The authors address the question whether phonological features can be used effectively in an automat...
Abstract Computer-aided pronunciation training (CAPT) technologies enable the use of automatic speec...
Automatic Speech Recognition (ASR) can be very useful in language learning tools in order to correct...
This thesis presents a detailed study on psychoacoustic modeling for feature extraction for robust s...
Recently, Li et al. proposed a new auditory feature for robust speech recognition in noise environme...
In this paper, we investigate the problem of mispronuncia-tion detection by considering the influenc...
A new approach for speech feature extraction in automatic speech recognition (ASR) is proposed in th...
Motivation for Speaker recognition work is presented in the first part of the thesis. An exhaustive ...
Recently, Li et al. proposed a new auditory feature for robust speech recognition in noise environme...
This thesis presents methods and results for optimizing subword detectors in continuous speech. Spee...
State-of-the-art automatic speech recognition (ASR) systems are significantly inferior to humans esp...