This work presents a method of reconstructing a speech signal from a stream of MFCC vectors using a source-filter model of speech production. The MFCC vectors are used to provide an estimate of the vocal tract filter. This is achieved by inverting the MFCC vector back to a smoothed estimate of the magnitude spectrum. The Wiener-Khintchine theorem and linear predictive analysis transform this into an estimate of the vocal tract filter coefficients. The excitation signal is produced from a series of pitch pulses or white noise, depending on whether the speech is voiced or unvoiced. This pitch estimate forms an extra element of the feature vector. Listening tests reveal that the reconstructed speech is intelligible and of similar quality to a ...
This thesis addresses the problem of quality degradation in speech produced by parameter-based speec...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...
This work presents a method of reconstructing a speech signal from a stream of MFCC vectors using a ...
This work proposes a method for predicting the fundamental frequency and voicing of a frame of speec...
The paper proposes a technique for reconstructing an acoustic speech signal solely from a stream of ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The classical front end analysis in speech recognition is a spectral analysis which parametrizes the...
The aim of this work is to reconstruct clean speech solely from a stream of noise-contaminated MFCC ...
This work proposes a method to reconstruct an acoustic speech signal solely from a stream of mel-fre...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
Mel Frequency Cepstral Coefficients (MFCCs) are the most popularly used speech features in most spee...
This chapter is concerned with feature extraction and back-end speech reconstruction and is particul...
This paper proposes a method for generating speech from filterbank mel frequency cepstral coefficien...
This thesis addresses the problem of quality degradation in speech produced by parameter-based speec...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...
This work presents a method of reconstructing a speech signal from a stream of MFCC vectors using a ...
This work proposes a method for predicting the fundamental frequency and voicing of a frame of speec...
The paper proposes a technique for reconstructing an acoustic speech signal solely from a stream of ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The classical front end analysis in speech recognition is a spectral analysis which parametrizes the...
The aim of this work is to reconstruct clean speech solely from a stream of noise-contaminated MFCC ...
This work proposes a method to reconstruct an acoustic speech signal solely from a stream of mel-fre...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
Mel Frequency Cepstral Coefficients (MFCCs) are the most popularly used speech features in most spee...
This chapter is concerned with feature extraction and back-end speech reconstruction and is particul...
This paper proposes a method for generating speech from filterbank mel frequency cepstral coefficien...
This thesis addresses the problem of quality degradation in speech produced by parameter-based speec...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...