Artículo de publicación ISIThis paper proposes a novel feature-space VTLN (vocal tract length normalization) method that models frequency warping as a linear interpolation of contiguous Mel filter-bank energies. The presented technique aims to reduce the distortion in the Mel filter-bank energy estimation due to the harmonic composition of voiced speech intervals and DFT (discrete Fourier transform) sampling when the central frequency of band-pass filters is shifted. This paper also proposes an analytical maximum likelihood (ML) method to estimate the optimal warping factor in the cepstral space. The presented interpolated filter-bank energy- based VTLN leads to relative reductions inWER (word error rate) as high as 11.2% and 7.6...
To reduce inter-speaker variability, vocal tract length normalization (VTLN) is commonly used to tra...
Vocal tract length normalisation (VTLN) is a commonly used speaker normalisation approach. It is att...
This paper presents speaker normalization approaches for audio search task. Conventional state-of-th...
Artículo de publicación ISIThis paper proposes a novel feature-space VTLN (vocal tract length norma...
Vocal Tract Length Normalization (VTLN) for standard filterbank-based Mel Frequency Cepstral Coeffic...
We propose a new easy-to-implement method to compute a Lin-ear Transform (LT) to perform Vocal Tract...
Vocal tract length normalization is an important feature normalization technique that can be used to...
The advent of statistical speech synthesis has enabled the unification of the basic techniques used ...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
Vocal tract length normalization (VTLN) has been successfully used in automatic speech recognition f...
Abstract. Inter-speaker variability, one of the problems faced in speech recognition system, has cau...
Inter-speaker variability, one of the problems faced in speech recognition system, has caused the pe...
One of the major challenges for Automatic Speech Recognition is to handle speech variability. Inter-...
Vocal tract length normalisation (VTLN) is a well known rapid adaptation technique. VTLN as a linear...
One of the major challenges for Automatic Speech Recognition is to handle speech variability. Inter-...
To reduce inter-speaker variability, vocal tract length normalization (VTLN) is commonly used to tra...
Vocal tract length normalisation (VTLN) is a commonly used speaker normalisation approach. It is att...
This paper presents speaker normalization approaches for audio search task. Conventional state-of-th...
Artículo de publicación ISIThis paper proposes a novel feature-space VTLN (vocal tract length norma...
Vocal Tract Length Normalization (VTLN) for standard filterbank-based Mel Frequency Cepstral Coeffic...
We propose a new easy-to-implement method to compute a Lin-ear Transform (LT) to perform Vocal Tract...
Vocal tract length normalization is an important feature normalization technique that can be used to...
The advent of statistical speech synthesis has enabled the unification of the basic techniques used ...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
Vocal tract length normalization (VTLN) has been successfully used in automatic speech recognition f...
Abstract. Inter-speaker variability, one of the problems faced in speech recognition system, has cau...
Inter-speaker variability, one of the problems faced in speech recognition system, has caused the pe...
One of the major challenges for Automatic Speech Recognition is to handle speech variability. Inter-...
Vocal tract length normalisation (VTLN) is a well known rapid adaptation technique. VTLN as a linear...
One of the major challenges for Automatic Speech Recognition is to handle speech variability. Inter-...
To reduce inter-speaker variability, vocal tract length normalization (VTLN) is commonly used to tra...
Vocal tract length normalisation (VTLN) is a commonly used speaker normalisation approach. It is att...
This paper presents speaker normalization approaches for audio search task. Conventional state-of-th...