[[abstract]]This study investigates language modeling for Mandarin continuous speech recognition. First, a word topical mixture model (WTMM) was proposed to explore the co-occurrence relationship between words, as well as the long-span latent topical information, for language model adaptation. During Speech recognition, the search history is modeled as a composite WTMM model for predicting a newly decoded word. Second, a position-dependent language model was presented to make use of the word positional information within documents and sentences for better estimation of word occurrences. The word positional information was exploited in conjunction with that information provided by the conventional N-gram and probabilistic latent semantic ana...
In this paper, a new approach of using temporal information to assist in Mandarin speech recognition...
This thesis presents a colloquial language modeling technique for spontaneous Cantonese speech recog...
[[abstract]]Huge quantities of multimedia contents including audio and video are continuously growin...
[[abstract]]Statistical language modeling, which aims to capture the regularities in human natural l...
This paper considers dynamic language model adaptation for Mandarin broadcast news recognition. Both...
[[abstract]]This thesis investigated the use of various kinds of confidence measures for Mandarin la...
To improve the Mandarin large vocabulary continuous speech recognition (LVCSR), a unified framework ...
This article investigates the use of several lightly supervised and data-driven approaches to Mandar...
This paper presents an empirical study of word error minimization approaches for Mandarin large voca...
This paper considers extractive summarization of Chinese spoken documents. In contrast to convention...
[[abstract]]N-gram language modeling is a crucial component in any speech recognizer since it is exp...
In this paper, a basic Mandarin broadcast news speech recognition system is constructed using the MA...
This paper presents a new framework for improved large vocabulary Mandarin speech recognition using ...
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and ...
We explore the integration of multiple factors such as genre and speaker gender for acoustic model a...
In this paper, a new approach of using temporal information to assist in Mandarin speech recognition...
This thesis presents a colloquial language modeling technique for spontaneous Cantonese speech recog...
[[abstract]]Huge quantities of multimedia contents including audio and video are continuously growin...
[[abstract]]Statistical language modeling, which aims to capture the regularities in human natural l...
This paper considers dynamic language model adaptation for Mandarin broadcast news recognition. Both...
[[abstract]]This thesis investigated the use of various kinds of confidence measures for Mandarin la...
To improve the Mandarin large vocabulary continuous speech recognition (LVCSR), a unified framework ...
This article investigates the use of several lightly supervised and data-driven approaches to Mandar...
This paper presents an empirical study of word error minimization approaches for Mandarin large voca...
This paper considers extractive summarization of Chinese spoken documents. In contrast to convention...
[[abstract]]N-gram language modeling is a crucial component in any speech recognizer since it is exp...
In this paper, a basic Mandarin broadcast news speech recognition system is constructed using the MA...
This paper presents a new framework for improved large vocabulary Mandarin speech recognition using ...
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and ...
We explore the integration of multiple factors such as genre and speaker gender for acoustic model a...
In this paper, a new approach of using temporal information to assist in Mandarin speech recognition...
This thesis presents a colloquial language modeling technique for spontaneous Cantonese speech recog...
[[abstract]]Huge quantities of multimedia contents including audio and video are continuously growin...