In Mandarin speech recognition, initial-final subword units are commonly used. According to the Frequency Dictionary of Modern Chinese[4], among the top 9000 most frequent words, 26.7% are unigrams, 69.8% are bigrams, 2.7% are trigrams, 0.0007% 4-grams, and 0.0002% 5-grams. Another study[19] showed that in general, 75% of Chinese words are bigrams, 14% trigrams, 6% n-grams with n [greater than] 3. Each character is monosyllabic. If initial-final segmentation is used, each Chinese word would only consist of two to six units. This is relatively short compared with English words which contain about seven phonemes on average. For this reason, the utterance verification of Chinese keywords performs relatively lower than English particularly for ...
The nonlinear dynamic characteristics of expansion and contraction and the sequential time-varying f...
High error rate in speech recognition is largely due to effects of phone local mismatch caused by un...
The Chinese language is based on characters which are syllabic in nature. Since languages have sylla...
In this paper, an effective approach for Chinese speech recognition on small vocabulary size is prop...
The work in this paper concerns the determination of a recognition unit in a small footprint Chinese...
[[abstract]]This thesis investigated the use of various kinds of confidence measures for Mandarin la...
Speaker segmentation is widely used in many tasks such as multi-speaker detection and speaker tracki...
Many practical speech recognition applications such as in information retrieval need to be portable ...
Abstract—The length of the test speech greatly influences the performance of GMM-UBM based text-inde...
This paper reports a detailed study on Minimum Phone Er-ror (MPE), Minimum Phone Frame Error (MPFE),...
In this paper, several special speech recognition approaches based on hidden Markov models (HMMs) ar...
The pronunciation variability is an important issue that must be faced with when developing practica...
This paper presents an empirical study of word error minimization approaches for Mandarin large voca...
the role of complete speech units like syllables in carrying speaker information needs further inves...
This paper presents an empirical study of word error minimization approaches for Mandarin large voca...
The nonlinear dynamic characteristics of expansion and contraction and the sequential time-varying f...
High error rate in speech recognition is largely due to effects of phone local mismatch caused by un...
The Chinese language is based on characters which are syllabic in nature. Since languages have sylla...
In this paper, an effective approach for Chinese speech recognition on small vocabulary size is prop...
The work in this paper concerns the determination of a recognition unit in a small footprint Chinese...
[[abstract]]This thesis investigated the use of various kinds of confidence measures for Mandarin la...
Speaker segmentation is widely used in many tasks such as multi-speaker detection and speaker tracki...
Many practical speech recognition applications such as in information retrieval need to be portable ...
Abstract—The length of the test speech greatly influences the performance of GMM-UBM based text-inde...
This paper reports a detailed study on Minimum Phone Er-ror (MPE), Minimum Phone Frame Error (MPFE),...
In this paper, several special speech recognition approaches based on hidden Markov models (HMMs) ar...
The pronunciation variability is an important issue that must be faced with when developing practica...
This paper presents an empirical study of word error minimization approaches for Mandarin large voca...
the role of complete speech units like syllables in carrying speaker information needs further inves...
This paper presents an empirical study of word error minimization approaches for Mandarin large voca...
The nonlinear dynamic characteristics of expansion and contraction and the sequential time-varying f...
High error rate in speech recognition is largely due to effects of phone local mismatch caused by un...
The Chinese language is based on characters which are syllabic in nature. Since languages have sylla...