The majority of automatic speech recognition (ASR) systems rely on hidden Markov models (HMM), in which the output distribution associated with each state is modelled by a mixture of diagonal covariance Gaussians. Dynamic information is typically included by appending time-derivatives to feature vectors. This approach, whilst successful, makes the false assumption of framewise independence of the augmented feature vectors and ignores the spatial correlations in the parametrised speech signal. This dissertation seeks to address these shortcomings by exploring acoustic modelling for ASR with an application of a form of state-space model, the linear dynamic model (LDM). Rather than modelling individual frames of data, LDMs characterize entire ...
Abstract. A statistical generative model for the speech process is described that embeds a substanti...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...
The majority of automatic speech recognition (ASR) systems rely on hidden Markov models (HMM), in wh...
The majority of automatic speech recognition (ASR) systems rely on hidden Markov models, in which Ga...
We propose that using a continuous trajectory model to describe an articulatory-based feature set wi...
In the past decades, statistics-based hidden Markov models (HMMs) have become the predominant approa...
Summarization: Although hidden Markov models (HMMs) provide a relatively efficient modeling framewor...
The standard hidden Markov model (HMM) has been proved to be the most successful model for speech re...
Declaration This dissertation is the result of my own work and includes nothing which is the outcome...
The general subject of this work is to present mathematical methods encountered in auto-matic speech...
This thesis describes work developing an approach to automatic speech recognition which incorporates...
Abstract—A new approach to represent temporal correlation in an automatic speech recognition system ...
The conditional independence assumption imposed by the hidden Markov models (HMMs) makes it difficul...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
Abstract. A statistical generative model for the speech process is described that embeds a substanti...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...
The majority of automatic speech recognition (ASR) systems rely on hidden Markov models (HMM), in wh...
The majority of automatic speech recognition (ASR) systems rely on hidden Markov models, in which Ga...
We propose that using a continuous trajectory model to describe an articulatory-based feature set wi...
In the past decades, statistics-based hidden Markov models (HMMs) have become the predominant approa...
Summarization: Although hidden Markov models (HMMs) provide a relatively efficient modeling framewor...
The standard hidden Markov model (HMM) has been proved to be the most successful model for speech re...
Declaration This dissertation is the result of my own work and includes nothing which is the outcome...
The general subject of this work is to present mathematical methods encountered in auto-matic speech...
This thesis describes work developing an approach to automatic speech recognition which incorporates...
Abstract—A new approach to represent temporal correlation in an automatic speech recognition system ...
The conditional independence assumption imposed by the hidden Markov models (HMMs) makes it difficul...
Conventional speech recognition systems consist of feature extraction, acoustic and language modelin...
Abstract. A statistical generative model for the speech process is described that embeds a substanti...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...