This work compares the accuracy of fundamental frequency and formant frequency estimation methods and maximum a posteriori (MAP) prediction from MFCC vectors with hand-corrected references. Five fundamental frequency estimation methods are compared to fundamental frequency prediction from MFCC vectors in both clean and noisy speech. Similarly, three formant frequency estimation and prediction methods are compared. An analysis of estimation and prediction accuracy shows that prediction from MFCCs provides the most accurate voicing classification across clean and noisy speech. On clean speech, fundamental frequency estimation outperforms prediction from MFCCs, but as noise increases the performance of prediction is significantly more robust t...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
This paper describes how formant frequencies of voiced and unvoiced speech can be predicted from mel...
This work compares the accuracy of fundamental frequency and formant frequency estimation methods an...
This work proposes a method to reconstruct an acoustic speech signal solely from a stream of mel-fre...
This work proposes a method for predicting the fundamental frequency and voicing of a frame of speec...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...
Novel methods are presented for predicting formant frequencies and voicing class from mel-frequency ...
Novel methods are presented for predicting formant frequencies and voicing class from mel-frequency ...
This paper proposes a method of predicting the formant frequencies of a frame of speech from its mel...
This paper proposes a method of predicting the formant frequencies of a frame of speech from its mel...
The aim of this work is to reconstruct clean speech solely from a stream of noise-contaminated MFCC ...
This paper examines the effect of applying noise compensation to acoustic speech feature prediction ...
The paper proposes a technique for reconstructing an acoustic speech signal solely from a stream of ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
This paper describes how formant frequencies of voiced and unvoiced speech can be predicted from mel...
This work compares the accuracy of fundamental frequency and formant frequency estimation methods an...
This work proposes a method to reconstruct an acoustic speech signal solely from a stream of mel-fre...
This work proposes a method for predicting the fundamental frequency and voicing of a frame of speec...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...
This work proposes a method to predict the fundamental frequency and voicing of a frame of speech fr...
Novel methods are presented for predicting formant frequencies and voicing class from mel-frequency ...
Novel methods are presented for predicting formant frequencies and voicing class from mel-frequency ...
This paper proposes a method of predicting the formant frequencies of a frame of speech from its mel...
This paper proposes a method of predicting the formant frequencies of a frame of speech from its mel...
The aim of this work is to reconstruct clean speech solely from a stream of noise-contaminated MFCC ...
This paper examines the effect of applying noise compensation to acoustic speech feature prediction ...
The paper proposes a technique for reconstructing an acoustic speech signal solely from a stream of ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
This paper describes how formant frequencies of voiced and unvoiced speech can be predicted from mel...