The overall success of automatic speech recognition (ASR) depends on efficient phoneme recognition performance and quality of speech signal received in ASR. However, dissimilar inputs of speakers affect the overall recognition performance. One of the main problems that affect recognition performance is inter-speaker variability. Vocal tract length normalization (VTLN) is introduced to compensate inter-speaker variation on the speaker signal by applying speaker-specific warping of the frequency scale of a filter bank. Instead of measuring the performance on word level with speaker-specific warping, this research focuses on direct tackling at the phoneme level and applying VTLN on all speakers' speech signals to analyse the best setting for t...
Augmenting datasets by transforming inputs in a way that does not change the label is a crucial ingr...
This paper presents speaker normalization approaches for audio search task. Conventional state-of-th...
An efficient algorithm for speaker-independent spoken word recognition is presented. This algorithm ...
Abstract. Inter-speaker variability, one of the problems faced in speech recognition system, has cau...
Inter-speaker variability, one of the problems faced in speech recognition system, has caused the pe...
In speech recognition, speaker-dependence of a speech recognition system comes from speaker-dependen...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
In this paper, we consider the generation of features for automatic speech recognition (ASR) that ar...
In most automatic speech recognition (ASR) systems, speaker differences are compensated by normalizi...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
A proven method for achieving effective automatic speech recognition (ASR) due to speaker difference...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
One of the major challenges for Automatic Speech Recognition is to handle speech variability. Inter-...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
Artículo de publicación ISIThis paper proposes a novel feature-space VTLN (vocal tract length norma...
Augmenting datasets by transforming inputs in a way that does not change the label is a crucial ingr...
This paper presents speaker normalization approaches for audio search task. Conventional state-of-th...
An efficient algorithm for speaker-independent spoken word recognition is presented. This algorithm ...
Abstract. Inter-speaker variability, one of the problems faced in speech recognition system, has cau...
Inter-speaker variability, one of the problems faced in speech recognition system, has caused the pe...
In speech recognition, speaker-dependence of a speech recognition system comes from speaker-dependen...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
In this paper, we consider the generation of features for automatic speech recognition (ASR) that ar...
In most automatic speech recognition (ASR) systems, speaker differences are compensated by normalizi...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
A proven method for achieving effective automatic speech recognition (ASR) due to speaker difference...
One of the main problems faced by automatic speech recognition is the variability of the testing con...
One of the major challenges for Automatic Speech Recognition is to handle speech variability. Inter-...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
Artículo de publicación ISIThis paper proposes a novel feature-space VTLN (vocal tract length norma...
Augmenting datasets by transforming inputs in a way that does not change the label is a crucial ingr...
This paper presents speaker normalization approaches for audio search task. Conventional state-of-th...
An efficient algorithm for speaker-independent spoken word recognition is presented. This algorithm ...