The nonlinear dynamic characteristics of expansion and contraction and the sequential time-varying features of the syllable pronunciations greatly complicate the tasks of automatic speech recognition. Each syllable is represented by a sequence of vectors of linear predict coding cepstra (LPCC). Even if the same speaker utters the same syllable, the duration of stable parts of the sequence of LPCC vectors changes every time. Therefore, the duration of stable parts is contracted such that the compressed speech waveform has the same length. We propose five different simple techniques to contract the stable parts of the sequence of LPCC vectors. A simplified Bayes decision rule with a weighted variance is used to classify 408 speaker-dependent ...
In this paper, a new approach of using temporal information to assist in Mandarin speech recognition...
Abstract. Tone plays an important lexical role in spoken tonal languages like Mandarin Chinese. In t...
The article presents a robust representation of speech based on AR modeling of the causal part of th...
In this paper, several special speech recognition approaches based on hidden Markov models (HMMs) ar...
[[abstract]]A speech recognition system for all the Chinese syllables is described. The system is a ...
This paper presents a new framework for improved large vocabulary Mandarin speech recognition using ...
This paper presents a new framework for improved large vocabulary Mandarin speech recognition using ...
Learning a new language may be difficult for adults, especially the communication aspect of the proc...
Learning a new language may be difficult for adults, especially the communication aspect of the proc...
Learning a new language may be difficult for adults, especially the communication aspect of the proc...
The pronunciation variability is an important issue that must be faced with when developing practica...
In speaker-independent speech recognition, the disadvantage of the most diffused technology (HMMs, o...
[[abstract]]This thesis investigated the use of various kinds of confidence measures for Mandarin la...
In speaker-independent speech recognition, the disadvantage of the most diffused technology ( Hidden...
In this paper, a new approach of using temporal information to assist in Mandarin speech recognition...
In this paper, a new approach of using temporal information to assist in Mandarin speech recognition...
Abstract. Tone plays an important lexical role in spoken tonal languages like Mandarin Chinese. In t...
The article presents a robust representation of speech based on AR modeling of the causal part of th...
In this paper, several special speech recognition approaches based on hidden Markov models (HMMs) ar...
[[abstract]]A speech recognition system for all the Chinese syllables is described. The system is a ...
This paper presents a new framework for improved large vocabulary Mandarin speech recognition using ...
This paper presents a new framework for improved large vocabulary Mandarin speech recognition using ...
Learning a new language may be difficult for adults, especially the communication aspect of the proc...
Learning a new language may be difficult for adults, especially the communication aspect of the proc...
Learning a new language may be difficult for adults, especially the communication aspect of the proc...
The pronunciation variability is an important issue that must be faced with when developing practica...
In speaker-independent speech recognition, the disadvantage of the most diffused technology (HMMs, o...
[[abstract]]This thesis investigated the use of various kinds of confidence measures for Mandarin la...
In speaker-independent speech recognition, the disadvantage of the most diffused technology ( Hidden...
In this paper, a new approach of using temporal information to assist in Mandarin speech recognition...
In this paper, a new approach of using temporal information to assist in Mandarin speech recognition...
Abstract. Tone plays an important lexical role in spoken tonal languages like Mandarin Chinese. In t...
The article presents a robust representation of speech based on AR modeling of the causal part of th...