In this thesis, we study the problem of learning a linear transformation of acoustic feature vectors for speech recognition, in a framework where apart from the acoustics, additional views are available at training time. We consider a multiview learning approach based on canonical correlation analysis to learn linear transformations of the acoustic features that are maximally correlated with the data. We propose simple approaches for combining information shared across the views with information that is private to the acoustic view. We apply these methods to a specific scenario in which articulatory data is available at training time. Results of phonetic frame classification on data drawn from the University of Wisconsin X-ray Microbeam...
In multi-view learning, massive literature is devoted to exploring the intrinsic structure between c...
In most HMM-based recognition systems, a mixture of diagonal covariance gaussians is used to model t...
With the advancement of information technology, a large amount of data are generated from different ...
In this thesis, we study the problem of learning a linear transformation of acoustic feature vectors...
In this thesis, we study the problem of learning a linear transformation of acoustic feature vectors...
It has been previously shown that, when both acoustic and artic-ulatory training data are available,...
It has been previously shown that, when both acoustic and artic-ulatory training data are available,...
Previous work has shown that acoustic features can be im-proved by unsupervised learning of transfor...
[1] Bharadwaj, Arora, Livescu, and Hasegawa-Johnson. Multi-view acoustic feature learning using arti...
We consider learning representations (features) in the setting in which we have access to mul-tiple ...
Automatic speaker recognition in uncontrolled environments is a very challenging task due to channel...
Hotelling's Canonical Correlation Analysis (CCA) works with two sets of related variables, also call...
In the classification of non-stationary time series data such as sounds, it is often tedious and exp...
A probabilistic and statistical framework is presented for automatic speech recognition based on a p...
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Acknowledgments . . . . . ...
In multi-view learning, massive literature is devoted to exploring the intrinsic structure between c...
In most HMM-based recognition systems, a mixture of diagonal covariance gaussians is used to model t...
With the advancement of information technology, a large amount of data are generated from different ...
In this thesis, we study the problem of learning a linear transformation of acoustic feature vectors...
In this thesis, we study the problem of learning a linear transformation of acoustic feature vectors...
It has been previously shown that, when both acoustic and artic-ulatory training data are available,...
It has been previously shown that, when both acoustic and artic-ulatory training data are available,...
Previous work has shown that acoustic features can be im-proved by unsupervised learning of transfor...
[1] Bharadwaj, Arora, Livescu, and Hasegawa-Johnson. Multi-view acoustic feature learning using arti...
We consider learning representations (features) in the setting in which we have access to mul-tiple ...
Automatic speaker recognition in uncontrolled environments is a very challenging task due to channel...
Hotelling's Canonical Correlation Analysis (CCA) works with two sets of related variables, also call...
In the classification of non-stationary time series data such as sounds, it is often tedious and exp...
A probabilistic and statistical framework is presented for automatic speech recognition based on a p...
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Acknowledgments . . . . . ...
In multi-view learning, massive literature is devoted to exploring the intrinsic structure between c...
In most HMM-based recognition systems, a mixture of diagonal covariance gaussians is used to model t...
With the advancement of information technology, a large amount of data are generated from different ...