Speech and other natural sounds show high temporal correlation and smooth spectral evolution punctuated by a few, irregular and abrupt changes. In a conventional Hidden Markov Model (HMM), such structure is represented weakly and indirectly through transitions between explicit states representing 'steps' along such smooth changes. It would be more efficient and informative to model successive spectra as transformations of their immediate predecessors, and we present a model which focuses on local deformations of adjacent bins in a time-frequency surface to explain an observed sound, using explicit representation only for those bins that cannot be predicted from their context. We further decompose the log-spectrum into two additive layers, w...
Previously we presented an auditory-inspired feed-forward architecture which achieves good performan...
A novel method for speech recognition is presented, utilizing nonlinear/chaotic signal processing te...
Sound textures - for instance, a crackling fire, running water, or applause - constitute a large and...
Speaker models for blind source separation are typically based on HMMs consisting of vast numbers of...
This paper presents a novel acoustic modeling framework that naturally extends the Hidden Markov Mod...
Methods for blind source separation based only on general properties such as source independence enc...
Detailed hidden Markov models (HMMs) that capture the constraints implicit in a particular sound can...
This paper presents the theoretical basis and preliminary experimental results of a new HMM model, r...
HMM2 is a particular hidden Markov model where state emission probabilities of the temporal (primary...
Natural sounds are structured on many time-scales. A typical segment of speech, for example, contain...
Tracking vocal tract formant frequencies ($f_p$) and estimating the fundamental frequency ($f_0$) ar...
In hidden Markov model-based speech synthesis, speech is typically parameterized using source-filter...
We present a new probabilistic model for polyphonic audio termed Factorial Scaled Hidden Markov Mode...
State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov ...
This paper considers the problem of obtaining an accurate spectral representation of speech formant ...
Previously we presented an auditory-inspired feed-forward architecture which achieves good performan...
A novel method for speech recognition is presented, utilizing nonlinear/chaotic signal processing te...
Sound textures - for instance, a crackling fire, running water, or applause - constitute a large and...
Speaker models for blind source separation are typically based on HMMs consisting of vast numbers of...
This paper presents a novel acoustic modeling framework that naturally extends the Hidden Markov Mod...
Methods for blind source separation based only on general properties such as source independence enc...
Detailed hidden Markov models (HMMs) that capture the constraints implicit in a particular sound can...
This paper presents the theoretical basis and preliminary experimental results of a new HMM model, r...
HMM2 is a particular hidden Markov model where state emission probabilities of the temporal (primary...
Natural sounds are structured on many time-scales. A typical segment of speech, for example, contain...
Tracking vocal tract formant frequencies ($f_p$) and estimating the fundamental frequency ($f_0$) ar...
In hidden Markov model-based speech synthesis, speech is typically parameterized using source-filter...
We present a new probabilistic model for polyphonic audio termed Factorial Scaled Hidden Markov Mode...
State-of-the-art automatic speech recognition (ASR) techniques are typically based on hidden Markov ...
This paper considers the problem of obtaining an accurate spectral representation of speech formant ...
Previously we presented an auditory-inspired feed-forward architecture which achieves good performan...
A novel method for speech recognition is presented, utilizing nonlinear/chaotic signal processing te...
Sound textures - for instance, a crackling fire, running water, or applause - constitute a large and...