In this paper, we motivate the introduction of multiple feature streams to cover the gap between the noise-free and the estimated features in the context of Model-Based Feature Enhancement (MBFE) for noise robust speech recognition. Especially at low local SNR-levels the global MMSE-estimate might not be optimal and its uncertainty is large. Therefore, it is first shown how a constrained quadratic optimisation problem can improve the linear combination weights in the MMSE-formula. Alternatively, these weights are then approximated by K Kronecker deltas. Both approaches are compared by recognition experiments on the Aurora2 task. Also, Multiple Stream MBFE is validated on the large vocabulary Aurora4 benchmark task. On the latter, a decrease...
It is well known that additive noise can cause a significant decrease in performance for an automati...
Missing Data Theory (MDT) has shown to improve the robustness of automatic speech recognition (ASR) ...
In most speech recognition systems, acoustic features are extracted from the whole frequency spectru...
Maintaining a high level of robustness for Automatic Speech Recognition (ASR) systems is especially ...
Despite sophisticated present day automatic speech recognition (ASR) techniques, a single recognizer...
In this paper we focus on the challenging task of noise robustness for large vocabulary Continuous S...
This paper presents a multiple-model framework for noise-robust speech recognition. In this framewor...
In this paper we describe how we successfully extended the Model-Based Feature Enhancement (MBFE)-al...
This paper presents a novel feature extraction scheme tak-ing advantage of both the nonlinear modula...
Model-based techniques for robust speech recognition often require the statistics of noisy speech. I...
[[abstract]]In this paper, we present two novel algorithms to improve the noise robustness of featur...
In this thesis, the framework of multi-stream combination has been explored to improve the noise rob...
This paper investigates the use of heterogeneous features within a multi-stream framework, focusing ...
Automatic speech recognition (ASR) decodes speech signals into text. While ASR can produce accurate ...
We present a non-linear feature-domain noise reduction algorithm based on the minimum mean square er...
It is well known that additive noise can cause a significant decrease in performance for an automati...
Missing Data Theory (MDT) has shown to improve the robustness of automatic speech recognition (ASR) ...
In most speech recognition systems, acoustic features are extracted from the whole frequency spectru...
Maintaining a high level of robustness for Automatic Speech Recognition (ASR) systems is especially ...
Despite sophisticated present day automatic speech recognition (ASR) techniques, a single recognizer...
In this paper we focus on the challenging task of noise robustness for large vocabulary Continuous S...
This paper presents a multiple-model framework for noise-robust speech recognition. In this framewor...
In this paper we describe how we successfully extended the Model-Based Feature Enhancement (MBFE)-al...
This paper presents a novel feature extraction scheme tak-ing advantage of both the nonlinear modula...
Model-based techniques for robust speech recognition often require the statistics of noisy speech. I...
[[abstract]]In this paper, we present two novel algorithms to improve the noise robustness of featur...
In this thesis, the framework of multi-stream combination has been explored to improve the noise rob...
This paper investigates the use of heterogeneous features within a multi-stream framework, focusing ...
Automatic speech recognition (ASR) decodes speech signals into text. While ASR can produce accurate ...
We present a non-linear feature-domain noise reduction algorithm based on the minimum mean square er...
It is well known that additive noise can cause a significant decrease in performance for an automati...
Missing Data Theory (MDT) has shown to improve the robustness of automatic speech recognition (ASR) ...
In most speech recognition systems, acoustic features are extracted from the whole frequency spectru...