This paper proposes an integrated speech front-end for both speech recognition and speech reconstruction applications. Speech is first decomposed into a set of frequency bands by an auditory model. The output of this is then used to extract both robust pitch estimates and MFCC vectors. Initial tests used a 128 channel auditory model, but results show that this can be reduced significantly to between 23 and 32 channels. A detailed analysis of the pitch classification accuracy and the RMS pitch error shows the system to be more robust than both comb function and LPC-based pitch extraction. Speech recognition results show that the auditory-based cepstral coefficients give very similar performance to conventional MFCCs. Spectrograms and informa...
In this article the authors normalize the speech signal based on the publicly available AN4 database...
In this paper a model-based approach for restoring a continuous fundamental frequency (F0) contour f...
Speech recognition is an important and active analysis area of the recent years. This analysis aims ...
The paper proposes a technique for reconstructing an acoustic speech signal solely from a stream of ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
This chapter is concerned with feature extraction and back-end speech reconstruction and is particul...
This work presents a method of reconstructing a speech signal from a stream of MFCC vectors using a ...
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-f...
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-f...
This work presents a method of reconstructing a speech signal from a stream of MFCC vectors using a ...
The aim of this work is to reconstruct clean speech solely from a stream of noise-contaminated MFCC ...
This paper presents a novel feature extraction method to improve the performance of speaker identifi...
This paper presents a novel feature extraction method to improve the performance of speaker identifi...
In this article the authors normalize the speech signal based on the publicly available AN4 database...
In this paper a model-based approach for restoring a continuous fundamental frequency (F0) contour f...
Speech recognition is an important and active analysis area of the recent years. This analysis aims ...
The paper proposes a technique for reconstructing an acoustic speech signal solely from a stream of ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
The aim of this work is to enable a noise-free time-domain speech signal to be reconstructed from a ...
This chapter is concerned with feature extraction and back-end speech reconstruction and is particul...
This work presents a method of reconstructing a speech signal from a stream of MFCC vectors using a ...
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-f...
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-f...
This work presents a method of reconstructing a speech signal from a stream of MFCC vectors using a ...
The aim of this work is to reconstruct clean speech solely from a stream of noise-contaminated MFCC ...
This paper presents a novel feature extraction method to improve the performance of speaker identifi...
This paper presents a novel feature extraction method to improve the performance of speaker identifi...
In this article the authors normalize the speech signal based on the publicly available AN4 database...
In this paper a model-based approach for restoring a continuous fundamental frequency (F0) contour f...
Speech recognition is an important and active analysis area of the recent years. This analysis aims ...