VTLN Using Different Warping Functions for Template Matching

Madhavi, Maulik C
Sharma, Shubham
Patil, Hemant A

Open link

Publication date

January 2016

DOI

10.1007/978-3-319-30315-4_10

Publisher

SPRINGER-VERLAG BERLIN

Abstract

In most automatic speech recognition (ASR) systems, speaker differences are compensated by normalizing the vocal tract lengths of the speakers. This is implemented by warping the frequency-axis by appropriate warping factor. However, it is computationally expensive to find warping factor for each speaker. This problem is overcome by incorporating a universal warping function for all the speakers. Different psychoacoustic scales have been proposed over the past decade that are assumed to be similar to the frequency response of basilarmembrane (BM) of human auditory system. In this paper, different warping functions are studied with an aim of vocal tract length normalization (VTLN) and template matching experiments are done using dynamic time...

Extracted data

We use cookies to provide a better user experience.

Data Protection

VTLN Using Different Warping Functions for Template Matching

Abstract

Extracted data

VTLN Using Different Warping Functions for Template Matching

Abstract

Extracted data

Related items

Related items