Novel speech features calculated from third-order statistics of subband-filtered speech signals are introduced and studied for robust speech recognition. These features have the potential to capture nonlinear information not represented by cepstral coefficients. Also, because the features presented in this paper are based on the third-order moments, they may be more immune to Gaussian noise than cepstrals, as Gaussian distributions have zero third-order moments. Experiments on the AURORA2 database studying these features in combination with Mel-frequency cepstral coefficients (MFCC’s) are presented, and some improvement over the MFCC-only baseline is shown when clean speech is used for training, though the same improvement is not seen when ...
Abstract- Cepstral coefficients derived either through linear prediction (LP) analysis or from filte...
This thesis focuses on the use of third-order statistics in adaptive blind deconvolution of asymmetr...
A novel method for speech recognition is presented, utilizing nonlinear/chaotic signal processing te...
Abstract—Cepstral normalization has widely been used as a powerful approach to produce robust featur...
This dissertation introduces a new approach to estimation of the features used in an automatic speec...
In this paper, a feature extraction algorithm for robust speech recognition is introduced. The featu...
This dissertation introduces a new approach to estimation of the features used in an automatic speec...
The use of third-order statistics to determine the pitch of a speech signal and how they can elimina...
The results of investigations into some aspects of robust speech recognition are reported in this th...
In state-of-the-art speaker recognition systems, mel-scaled frequency cepstral coefficients (MFCCs) ...
The goal of speech parameterization is to extract the relevant information about what is being spoke...
In this paper we present a new statistical model, which de-scribes the corruption to speech recognit...
Many speech recognition systems use mel-frequency cep-stral coefficient (mfcc) feature extraction as...
We develop noise robust features using Gammatone wavelets derived from the popular Gammatone functio...
Abstract: The goal of speech parameterization is to extract the relevant information about what is b...
Abstract- Cepstral coefficients derived either through linear prediction (LP) analysis or from filte...
This thesis focuses on the use of third-order statistics in adaptive blind deconvolution of asymmetr...
A novel method for speech recognition is presented, utilizing nonlinear/chaotic signal processing te...
Abstract—Cepstral normalization has widely been used as a powerful approach to produce robust featur...
This dissertation introduces a new approach to estimation of the features used in an automatic speec...
In this paper, a feature extraction algorithm for robust speech recognition is introduced. The featu...
This dissertation introduces a new approach to estimation of the features used in an automatic speec...
The use of third-order statistics to determine the pitch of a speech signal and how they can elimina...
The results of investigations into some aspects of robust speech recognition are reported in this th...
In state-of-the-art speaker recognition systems, mel-scaled frequency cepstral coefficients (MFCCs) ...
The goal of speech parameterization is to extract the relevant information about what is being spoke...
In this paper we present a new statistical model, which de-scribes the corruption to speech recognit...
Many speech recognition systems use mel-frequency cep-stral coefficient (mfcc) feature extraction as...
We develop noise robust features using Gammatone wavelets derived from the popular Gammatone functio...
Abstract: The goal of speech parameterization is to extract the relevant information about what is b...
Abstract- Cepstral coefficients derived either through linear prediction (LP) analysis or from filte...
This thesis focuses on the use of third-order statistics in adaptive blind deconvolution of asymmetr...
A novel method for speech recognition is presented, utilizing nonlinear/chaotic signal processing te...