Speech recognition systems typically use mixtures of diagonal Gaussians to model the acoustics. Using Gaussians with a more general covariance structure can give improved performance; EM-LLT [1] and SPAM [2] models give improvements by restricting the inverse covariance to a linear/affine subspace spanned by rank one and full rank matrices respectively. In this paper we consider training these subspaces to maximize likelihood. For EMLLT ML training the subspace results in significant gains over the scheme proposed in [1]. For SPAM ML training of the subspace slightly improves performance over the method reported in [2]. For the same subspace size an EMLLT model is more efficient computationally than a SPAM model, while the SPAM model is mor...
CLASSIFICATION Maximum Likelihood (ML) modeling of multiclass data for classi cation often su ers fr...
Automatic Speaker Verification (ASV) is a critical task in pattern recognition and has been applied ...
Recently various techniques to improve the correlation model of feature vector elements in speech re...
We propose to use sparse inverse covariance matrices for acoustic model training when there is insuf...
Gaussian distributions are usually parameterized with their natural parameters: the mean and the co...
This paper applies the recently proposed SPAM models for acoustic modeling in a Speaker Adaptive Tra...
An estimation of parameters of a multivariate Gaussian Mixture Model is usually based on a criterion...
We address the problem of learning the structure of Gaussian graphical models for use in automatic s...
Most HMM-based speech recognition systems use Gaussian mixtures as observation probability density f...
Adaptation using linear transforms is well known to significantly improve the performance of speech ...
We discuss the applicability of large margin techniques to the prob-lem of estimating linear transfo...
HMM-based systems for Automatic Speech Recognition typically model the acoustic features using mixt...
In this paper, we study acoustic modeling for speech recognition using mixtures of exponential model...
Discriminative training has become an important means for estimating model parameters in many statis...
Automatic Speaker Verification (ASV) is a critical task in pattern recognition and has been applied ...
CLASSIFICATION Maximum Likelihood (ML) modeling of multiclass data for classi cation often su ers fr...
Automatic Speaker Verification (ASV) is a critical task in pattern recognition and has been applied ...
Recently various techniques to improve the correlation model of feature vector elements in speech re...
We propose to use sparse inverse covariance matrices for acoustic model training when there is insuf...
Gaussian distributions are usually parameterized with their natural parameters: the mean and the co...
This paper applies the recently proposed SPAM models for acoustic modeling in a Speaker Adaptive Tra...
An estimation of parameters of a multivariate Gaussian Mixture Model is usually based on a criterion...
We address the problem of learning the structure of Gaussian graphical models for use in automatic s...
Most HMM-based speech recognition systems use Gaussian mixtures as observation probability density f...
Adaptation using linear transforms is well known to significantly improve the performance of speech ...
We discuss the applicability of large margin techniques to the prob-lem of estimating linear transfo...
HMM-based systems for Automatic Speech Recognition typically model the acoustic features using mixt...
In this paper, we study acoustic modeling for speech recognition using mixtures of exponential model...
Discriminative training has become an important means for estimating model parameters in many statis...
Automatic Speaker Verification (ASV) is a critical task in pattern recognition and has been applied ...
CLASSIFICATION Maximum Likelihood (ML) modeling of multiclass data for classi cation often su ers fr...
Automatic Speaker Verification (ASV) is a critical task in pattern recognition and has been applied ...
Recently various techniques to improve the correlation model of feature vector elements in speech re...