We study a category of robust speech recognition problem in which mismatches exist between training and testing conditions, and no accurate knowledge of the mismatch mechanism is available. The only available information is the test data along with a set of pretrained Gaussian mixture continuous density hidden Markov models (CDHMMs). We investigate the problem from the viewpoint of Bayesian prediction. A simple prior distribution, namely constrained uniform distribution, is adopted to characterize the uncertainty of the mean vectors of the CDHMMs. Two methods, namely a model compensation technique based on Bayesian predictive density and a robust decision strategy called Viterbi Bayesian predictive classification are studied. The proposed m...
This paper addresses the problem of robust speech recognition in noisy conditions in the framework o...
Models dealing directly with the raw acoustic speech signal are an alternative to conventional featu...
Speech processing applications such as speech enhancement and speaker identification rely on the est...
Abstract—In this paper, we study a category of robust speech recognition problem in which mismatches...
recognition problem in which mismatches exist between training and testing conditions, and no accura...
We introduce a new decision strategy called Bayesian predictive classification (BPC) for robust spee...
We extend our previously proposed Viterbi Bayesian predictive classification (VBPC) algorithm to acc...
In this paper, we extend our previously proposed Viterbi Bayesian predictive classification (VBPC) a...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
We previously introduced a new Bayesian predictive classi-fication (BPC) approach to robust speech r...
This article provides a unifying Bayesian view on various approaches for acoustic model adaptation, ...
Abstract—In this paper, we propose a robust compensation strategy to deal effectively with extraneou...
Revised version including a bugfix in the computation of the Wiener uncertainty estimator and in the...
We consider the problem of Gaussian mixture model (GMM)-based classification of noisy data, where th...
Many techniques in speech processing require inference based on observations that are of- ten noisy,...
This paper addresses the problem of robust speech recognition in noisy conditions in the framework o...
Models dealing directly with the raw acoustic speech signal are an alternative to conventional featu...
Speech processing applications such as speech enhancement and speaker identification rely on the est...
Abstract—In this paper, we study a category of robust speech recognition problem in which mismatches...
recognition problem in which mismatches exist between training and testing conditions, and no accura...
We introduce a new decision strategy called Bayesian predictive classification (BPC) for robust spee...
We extend our previously proposed Viterbi Bayesian predictive classification (VBPC) algorithm to acc...
In this paper, we extend our previously proposed Viterbi Bayesian predictive classification (VBPC) a...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
We previously introduced a new Bayesian predictive classi-fication (BPC) approach to robust speech r...
This article provides a unifying Bayesian view on various approaches for acoustic model adaptation, ...
Abstract—In this paper, we propose a robust compensation strategy to deal effectively with extraneou...
Revised version including a bugfix in the computation of the Wiener uncertainty estimator and in the...
We consider the problem of Gaussian mixture model (GMM)-based classification of noisy data, where th...
Many techniques in speech processing require inference based on observations that are of- ten noisy,...
This paper addresses the problem of robust speech recognition in noisy conditions in the framework o...
Models dealing directly with the raw acoustic speech signal are an alternative to conventional featu...
Speech processing applications such as speech enhancement and speaker identification rely on the est...