The accuracy of the acoustic models in large vocabulary recognition systems can be improved by increasing the resolution in the acoustic feature space. This can be obtained by increasing the number of gaussian densities in the models by splitting of the gaussians. This paper proposes a novel algorithm for this splitting operation. It is based on the phonetic decision tree used for the state tying in context dependent modelling. Advantage of the method is that it improves the capability of the acoustic models to discriminate between the different tied states. The proposed splitting algorithm was evaluated on the Wall Street Journal recognition task. Comparison with a commonly used splitting algorithm clearly shows that our method can provi...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
The aim of discriminant feature analysis techniques in the signal processing of speech recognition s...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
In this paper, a fast segmental clustering approach to decision tree tying based acoustic modeling i...
Discriminative training has been established as an effective technique for training the acoustic mod...
In this work, a framework for efficient discriminative training and modeling is developed and implem...
Discriminative model combination is a new approach in the field of automatic speech recognition, whi...
Discriminative training has become an important means for estimating model parameters in many statis...
We have recently proposed a new acoustic model based on prob-abilistic linear discriminant analysis ...
In this paper, a new decision tree-based clustering technique called Phonetic, Dimensional and State...
Senones were introduced to share Hidden Markov model (HMM) parameters at a sub-phonetic level in [3]...
Speech recognition systems typically contain many Gaussian distributions, and hence a large number o...
Ebru Arısoy (MEF Author)##nofulltext##This paper summarizes the research on discriminative language ...
This paper investigates two important issues in constructing and combining ensembles of acoustic mo...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
The aim of discriminant feature analysis techniques in the signal processing of speech recognition s...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...
In an HMM based large vocabulary continuous speech recognition system, the evaluation of - context d...
In this paper, a fast segmental clustering approach to decision tree tying based acoustic modeling i...
Discriminative training has been established as an effective technique for training the acoustic mod...
In this work, a framework for efficient discriminative training and modeling is developed and implem...
Discriminative model combination is a new approach in the field of automatic speech recognition, whi...
Discriminative training has become an important means for estimating model parameters in many statis...
We have recently proposed a new acoustic model based on prob-abilistic linear discriminant analysis ...
In this paper, a new decision tree-based clustering technique called Phonetic, Dimensional and State...
Senones were introduced to share Hidden Markov model (HMM) parameters at a sub-phonetic level in [3]...
Speech recognition systems typically contain many Gaussian distributions, and hence a large number o...
Ebru Arısoy (MEF Author)##nofulltext##This paper summarizes the research on discriminative language ...
This paper investigates two important issues in constructing and combining ensembles of acoustic mo...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
The aim of discriminant feature analysis techniques in the signal processing of speech recognition s...
The task of a speech recogniser is to transcribe human speech into text. To do so, modern recogniser...