We have investigated the use of a pitch adaptive spectral representation on large vocabulary speech recognition, in conjunction with speaker normalisation techniques. We have compared the effect of a smoothed spectrogram to the pitch adaptive spectral analysis by decoupling these two components of STRAIGHT. Experiments performed on a large vocabulary meeting speech recognition task highlight the importance of combining a pitch adaptive spectral representation with a conventional fixed window spectral analysis. We found evidence that STRAIGHT pitch adaptive features are more speaker independent than conventional MFCCs without pitch adaptation, thus they also provide better performances when combined using feature combination techniques such ...
The changing on speech peaks structure is perhaps the most important cause of degradation of speech ...
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-f...
The changing on speech peaks structure is perhaps the most important cause of degradation of speech ...
In this paper we investigate the combination of complementary acoustic feature streams in large voca...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
One of the main problems faced by automatic speech recognition is the variability of the testing co...
In this article the authors normalize the speech signal based on the publicly available AN4 database...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
of the requirements for the Degree of Doctor of Philosophy There has been a substantial interest in ...
This paper presents a novel feature extraction method to improve the performance of speaker identifi...
This paper presents a novel feature extraction method to improve the performance of speaker identifi...
This paper presents a method for extracting MFCC parameters from a normalised power spectrum density...
In our earlier work [1], we employed MVDR (minimum variance distortionless response) based spectral ...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
In our earlier work [1], we employed MVDR (minimum variance distortionless response) based spectral ...
The changing on speech peaks structure is perhaps the most important cause of degradation of speech ...
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-f...
The changing on speech peaks structure is perhaps the most important cause of degradation of speech ...
In this paper we investigate the combination of complementary acoustic feature streams in large voca...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
One of the main problems faced by automatic speech recognition is the variability of the testing co...
In this article the authors normalize the speech signal based on the publicly available AN4 database...
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer S...
of the requirements for the Degree of Doctor of Philosophy There has been a substantial interest in ...
This paper presents a novel feature extraction method to improve the performance of speaker identifi...
This paper presents a novel feature extraction method to improve the performance of speaker identifi...
This paper presents a method for extracting MFCC parameters from a normalised power spectrum density...
In our earlier work [1], we employed MVDR (minimum variance distortionless response) based spectral ...
In this work, normalization techniques in the acoustic feature space are studied which improve the r...
In our earlier work [1], we employed MVDR (minimum variance distortionless response) based spectral ...
The changing on speech peaks structure is perhaps the most important cause of degradation of speech ...
This paper presents a novel approach to the design of a robust speaker recognition system. A noise-f...
The changing on speech peaks structure is perhaps the most important cause of degradation of speech ...