Non-negative spectral factorisation with long temporal context has been successfully used for noise robust recognition of speech in multi-source environments. Sparse classification from activations of speech atoms can be employed instead of conventional GMMs to determine speech state likelihoods. For accurate classification, cor-rect linguistic state labels must be assigned to speech atoms. We propose using non-negative matrix deconvolution for learning the labels with algorithms closely matching a framework that separates speech from additive noises. Experiments on the 1st CHiME Chal-lenge corpus show improvement in recognition accuracy over labels acquired from original atom sources or previously used least squares regression. The new app...
Discovering a representation that allows auditory data to be parsimoniously represented is useful fo...
Proceedings of: IberSPEECH 2012 Conference, Madrid, Spain, November 21-23, 2012.A speech denoising m...
Discovering a representation that allows auditory data to be parsimoniously represented is useful fo...
Recognition and classification of speech content in everyday environments is challenging due to the ...
Recognition and classification of speech content in everyday environments is challenging due to the ...
This paper proposes learning-based methods for mapping a sparse representation of noisy speech to st...
We introduce a framework for speech enhancement based on convolutive non-negative matrix factorizati...
The full modulation spectrum is a high-dimensional representation of one-dimensional audio signals. ...
Contains fulltext : 132233.pdf (publisher's version ) (Open Access)The full modula...
We propose a convolutive non-negative matrix factorization method to improve the intelligibility of ...
Speech recognition systems intended for everyday use must be able to cope with a large variety of no...
This paper proposes a speech recognition method for applications in adverse noisy environments. Spee...
We present a self-learning algorithm using a bottom-up based approach to automatically discover, acq...
The unsupervised learning of spectro-temporal patterns within speech signals is of interest in a bro...
Proceedings of: IberSPEECH 2012 Conference, Madrid, Spain, November 21-23, 2012.A speech denoising m...
Discovering a representation that allows auditory data to be parsimoniously represented is useful fo...
Proceedings of: IberSPEECH 2012 Conference, Madrid, Spain, November 21-23, 2012.A speech denoising m...
Discovering a representation that allows auditory data to be parsimoniously represented is useful fo...
Recognition and classification of speech content in everyday environments is challenging due to the ...
Recognition and classification of speech content in everyday environments is challenging due to the ...
This paper proposes learning-based methods for mapping a sparse representation of noisy speech to st...
We introduce a framework for speech enhancement based on convolutive non-negative matrix factorizati...
The full modulation spectrum is a high-dimensional representation of one-dimensional audio signals. ...
Contains fulltext : 132233.pdf (publisher's version ) (Open Access)The full modula...
We propose a convolutive non-negative matrix factorization method to improve the intelligibility of ...
Speech recognition systems intended for everyday use must be able to cope with a large variety of no...
This paper proposes a speech recognition method for applications in adverse noisy environments. Spee...
We present a self-learning algorithm using a bottom-up based approach to automatically discover, acq...
The unsupervised learning of spectro-temporal patterns within speech signals is of interest in a bro...
Proceedings of: IberSPEECH 2012 Conference, Madrid, Spain, November 21-23, 2012.A speech denoising m...
Discovering a representation that allows auditory data to be parsimoniously represented is useful fo...
Proceedings of: IberSPEECH 2012 Conference, Madrid, Spain, November 21-23, 2012.A speech denoising m...
Discovering a representation that allows auditory data to be parsimoniously represented is useful fo...