Abstract: Automatic speech recognition (ASR) systems usually consist of an acoustic model and a languagemodel. This paper describes a techniqueof an efficient deployment of t heacousticmodel parameters. The acoustic model typically utilizes Continuous Density Hi dden Markov Models (CDHMM). The output probability of a particular CDHMM state is represe nted by a Gaussian mixturedensitywithadiagonalcovariancestructure.Usually,theoutputprobabil itydensityfunction ofeachCDHMMstatecontainsthesamenumberofmixturecomponentsalthougha differentnumber of components in individual statesmay yieldmore accurate recognition resul ts, especially for low-resourceASRsystems.Thecentralideaistoassignmorecomponentsto stateswhereitiseffectiveand lesscomponentstost...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
Abstract-Speech recognition is formulated as a problem of maximum likelihood decoding. This formulat...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
This paper investigates the impact of subspace based techniques for acoustic modeling in automatic s...
Speech recognition applications are known to require a significant amount of resources. However, emb...
Dans cette thèse, nous proposons d’utiliser des techniques fondées sur l’analyse factorielle pour la...
In this thesis, we propose to use techniques based on factor analysis to build acoustic models for a...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
This thesis explores the use of efficient acoustic modeling techniques to improve the performance of...
Cette thèse se concentre sur la structuration du modèle acoustique pour améliorer la reconnaissance ...
The general subject of this work is to present mathematical methods encountered in auto-matic speech...
The majority of automatic speech recognition (ASR) systems rely on hidden Markov models (HMM), in wh...
Spoken words convey several levels of information. At the primary level, the speech conveys words or...
Automatic Speech Recognition (ASR) systems utilize statistical acoustic and language models to find ...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
Abstract-Speech recognition is formulated as a problem of maximum likelihood decoding. This formulat...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...
Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model...
This paper investigates the impact of subspace based techniques for acoustic modeling in automatic s...
Speech recognition applications are known to require a significant amount of resources. However, emb...
Dans cette thèse, nous proposons d’utiliser des techniques fondées sur l’analyse factorielle pour la...
In this thesis, we propose to use techniques based on factor analysis to build acoustic models for a...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
This thesis explores the use of efficient acoustic modeling techniques to improve the performance of...
Cette thèse se concentre sur la structuration du modèle acoustique pour améliorer la reconnaissance ...
The general subject of this work is to present mathematical methods encountered in auto-matic speech...
The majority of automatic speech recognition (ASR) systems rely on hidden Markov models (HMM), in wh...
Spoken words convey several levels of information. At the primary level, the speech conveys words or...
Automatic Speech Recognition (ASR) systems utilize statistical acoustic and language models to find ...
In conventional hidden Markov model (HMM) based speech recognisers, the emitting HMM states are mode...
Abstract-Speech recognition is formulated as a problem of maximum likelihood decoding. This formulat...
The hypothesis that for a given amount of training data a speaker model has an optimum number of com...