Recently there has been interest in structured discriminative models for speech recognition. In these models sentence posteriors are directly modelled, given a set of features extracted from the observation sequence, and hypothesised word sequence. In previous work these discriminative models have been combined with features derived from generative models for noise-robust speech recognition for continuous digits. This paper extends this work to medium to large vocabulary tasks. The form of the score-space extracted using the generative models, and parameter tying of the discriminative model, are both discussed. Update formulae for both conditional maximum likelihood and minimum Bayes' risk training are described. Experimental results are pr...
In this paper we propose discriminative training of hierarchical acoustic models for large vocabular...
We present a proposal of a kernel-based model for large vocabulary continuous speech recognizer. The...
Maintaining a high level of robustness for Automatic Speech Recognition (ASR) systems is especially ...
Recently there has been interest in structured discriminative models for speech recognition. In thes...
Generative models, normally in the form of hidden Markov models, have been the dominant form of acou...
Generative models, normally in the form of hidden Markov models, have been the dominant form of acou...
Automatic speech recognition (ASR) systems classify structured sequence data, where the label sequen...
In this paper we focus on the challenging task of noise robustness for large vocabulary Continuous S...
This paper describes a structured SVM framework suitable for noise-robust medium/large vocabulary sp...
This is the third and final progress report for epsrc Project ep/i006583/1 (Generative Kernels and S...
In this work, a framework for efficient discriminative training and modeling is developed and implem...
We present a new method for speech denoising and robust speech recognition. Using the framework of p...
Speech dynamic features, which provide smoothed estimates of the derivatives of the spectral paramet...
Speech dynamic features, which provide smoothed estimates of the derivatives of the spectral paramet...
Abstract-Speech recognition is formulated as a problem of maximum likelihood decoding. This formulat...
In this paper we propose discriminative training of hierarchical acoustic models for large vocabular...
We present a proposal of a kernel-based model for large vocabulary continuous speech recognizer. The...
Maintaining a high level of robustness for Automatic Speech Recognition (ASR) systems is especially ...
Recently there has been interest in structured discriminative models for speech recognition. In thes...
Generative models, normally in the form of hidden Markov models, have been the dominant form of acou...
Generative models, normally in the form of hidden Markov models, have been the dominant form of acou...
Automatic speech recognition (ASR) systems classify structured sequence data, where the label sequen...
In this paper we focus on the challenging task of noise robustness for large vocabulary Continuous S...
This paper describes a structured SVM framework suitable for noise-robust medium/large vocabulary sp...
This is the third and final progress report for epsrc Project ep/i006583/1 (Generative Kernels and S...
In this work, a framework for efficient discriminative training and modeling is developed and implem...
We present a new method for speech denoising and robust speech recognition. Using the framework of p...
Speech dynamic features, which provide smoothed estimates of the derivatives of the spectral paramet...
Speech dynamic features, which provide smoothed estimates of the derivatives of the spectral paramet...
Abstract-Speech recognition is formulated as a problem of maximum likelihood decoding. This formulat...
In this paper we propose discriminative training of hierarchical acoustic models for large vocabular...
We present a proposal of a kernel-based model for large vocabulary continuous speech recognizer. The...
Maintaining a high level of robustness for Automatic Speech Recognition (ASR) systems is especially ...