We investigate speech recognition features related to voicing functions that indicate whether the vocal folds are vibrating. We describe two voicing features, periodicity and jitter, and demonstrate that they are powerful voicing discriminators. The periodicity and jitter features and their first and second time derivatives are appended to a standard 38-dimensional feature vector comprising the first and second time derivatives of the frame energy and the cepstral coefficients with their first and second time derivatives. HMM-based connected-digit (CD) and large-vocabulary (LV) recognition experiments comparing the traditional and extended feature sets show that voicing features and spectral information are complementary and that improved s...
Phenomena like filled pauses, laughter, breathing, hesitation, etc. play significant role in everyda...
The use of HMM (Hidden Markov Models) for speech recognition has been successful for various applica...
Abstract — As human listeners, it seems that we should be experts in processing vocal sounds. Here w...
We investigate a class of features related to voicing parameters that indicate whether the vocal cho...
Decomposition of speech signals into simultaneous streams of periodic and aperiodic information has ...
In this paper, we demonstrate two different methods for improving the accuracy and correctness of th...
In this paper the incorporation of important phonetic properties into hidden Markov models (HMM) is ...
very speech recognition system requires a signal representation that parametrically models the tempo...
We describe a speech recogniser which uses a speech production-motivated phonetic-feature descriptio...
Artificial neural networks (ANNs) have been used to classify phonetic features in speech. The featur...
Objective measurement of the severity of dysphonia typically requires signal processing algorithms a...
Decomposition of speech signals into simultaneous streams of periodic and aperiodic information has ...
This paper presents the theoretical basis and preliminary experimental results of a new HMM model, r...
HMMs are the dominating technique used in speech recognition today since they perform well in overal...
In one study on vocal emotion recognition using noise-vocoded speech (NVS), the high similarities be...
Phenomena like filled pauses, laughter, breathing, hesitation, etc. play significant role in everyda...
The use of HMM (Hidden Markov Models) for speech recognition has been successful for various applica...
Abstract — As human listeners, it seems that we should be experts in processing vocal sounds. Here w...
We investigate a class of features related to voicing parameters that indicate whether the vocal cho...
Decomposition of speech signals into simultaneous streams of periodic and aperiodic information has ...
In this paper, we demonstrate two different methods for improving the accuracy and correctness of th...
In this paper the incorporation of important phonetic properties into hidden Markov models (HMM) is ...
very speech recognition system requires a signal representation that parametrically models the tempo...
We describe a speech recogniser which uses a speech production-motivated phonetic-feature descriptio...
Artificial neural networks (ANNs) have been used to classify phonetic features in speech. The featur...
Objective measurement of the severity of dysphonia typically requires signal processing algorithms a...
Decomposition of speech signals into simultaneous streams of periodic and aperiodic information has ...
This paper presents the theoretical basis and preliminary experimental results of a new HMM model, r...
HMMs are the dominating technique used in speech recognition today since they perform well in overal...
In one study on vocal emotion recognition using noise-vocoded speech (NVS), the high similarities be...
Phenomena like filled pauses, laughter, breathing, hesitation, etc. play significant role in everyda...
The use of HMM (Hidden Markov Models) for speech recognition has been successful for various applica...
Abstract — As human listeners, it seems that we should be experts in processing vocal sounds. Here w...