This paper presents a decoding method for automatic speech recognition (ASR) that reduces the impact of local spectral and temporal variabilities on ASR performance. The procedure involves augmenting the standard Viterbi search for an optimum state sequence with a locally constrained search for optimum degrees of spectral warping or temporal warping applied to individual analysis frames. It is argued in the paper that this represents an efficient and effective method for compensating for local variability in speech which may have potential application to a broader array of speech transformations. The techniques are presented in the context of existing methods for frequency warping based speaker normalization and existing methods for computa...
A novel frame-wise model adaptation approach for reverberation-robust distant-talking speech recogni...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
The paper describes a system for automatic speech recognition (ASR) that is benchmarked with data of...
This paper presents a new modeling framework which naturally extends the Hidden Markov Model (HMM) a...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
This paper presents a novel acoustic modeling framework that naturally extends the Hidden Markov Mod...
The majority of automatic speech recognition (ASR) systems rely on hidden Markov models (HMM), in wh...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
Funding Information: This work was supported by the Academy of Finland (grants 329267, 330139). Publ...
This work investigates the application of spectral and temporal speech processing algorithms develop...
This paper investigates the impact of subspace based techniques for acoustic modeling in automatic s...
This thesis examines techniques to improve the robustness of automatic speech recogni-tion (ASR) sys...
This paper is concerned with increasing the robustness of automatic speech recognition systems (ASR)...
The performance of automatic speech recognition (ASR) is known to degrade under noise corruption. Su...
Automatic speech recognition (ASR) decodes speech signals into text. While ASR can produce accurate ...
A novel frame-wise model adaptation approach for reverberation-robust distant-talking speech recogni...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
The paper describes a system for automatic speech recognition (ASR) that is benchmarked with data of...
This paper presents a new modeling framework which naturally extends the Hidden Markov Model (HMM) a...
This thesis addresses the general problem of maintaining robust automatic speech recognition (ASR) p...
This paper presents a novel acoustic modeling framework that naturally extends the Hidden Markov Mod...
The majority of automatic speech recognition (ASR) systems rely on hidden Markov models (HMM), in wh...
This thesis examines techniques to improve the robustness of automatic speech recognition (ASR) syst...
Funding Information: This work was supported by the Academy of Finland (grants 329267, 330139). Publ...
This work investigates the application of spectral and temporal speech processing algorithms develop...
This paper investigates the impact of subspace based techniques for acoustic modeling in automatic s...
This thesis examines techniques to improve the robustness of automatic speech recogni-tion (ASR) sys...
This paper is concerned with increasing the robustness of automatic speech recognition systems (ASR)...
The performance of automatic speech recognition (ASR) is known to degrade under noise corruption. Su...
Automatic speech recognition (ASR) decodes speech signals into text. While ASR can produce accurate ...
A novel frame-wise model adaptation approach for reverberation-robust distant-talking speech recogni...
The thesis deals with different aspects of automatic speech recognition. After an introduction, whic...
The paper describes a system for automatic speech recognition (ASR) that is benchmarked with data of...