On Reducing Harmonic and Sampling Distortion in Vocal Tract Length Normalization

Becerra Yoma, Néstor
Garretón, Claudio
Huenupán, Fernando
Catalán, Ignacio
Wuth Sepúlveda, Jorge

Open link

Publication date

January 2013

DOI

10.1109/TASL.2012.2215590

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

Artículo de publicación ISIThis paper proposes a novel feature-space VTLN (vocal tract length normalization) method that models frequency warping as a linear interpolation of contiguous Mel filter-bank energies. The presented technique aims to reduce the distortion in the Mel filter-bank energy estimation due to the harmonic composition of voiced speech intervals and DFT (discrete Fourier transform) sampling when the central frequency of band-pass filters is shifted. This paper also proposes an analytical maximum likelihood (ML) method to estimate the optimal warping factor in the cepstral space. The presented interpolated filter-bank energy- based VTLN leads to relative reductions inWER (word error rate) as high as 11.2% and 7.6...

Extracted data

We use cookies to provide a better user experience.

Data Protection

On Reducing Harmonic and Sampling Distortion in Vocal Tract Length Normalization

Abstract

Extracted data

On Reducing Harmonic and Sampling Distortion in Vocal Tract Length Normalization

Abstract

Extracted data

Related items

Related items