An N2gram Chinese language model incorporating linguistic rules is presented. By constructing elements lattice, rules information is incorporated in statistical frame. To facilitate the hybrid modeling, novel methods such as MI2 based rule evaluating, weighted rule quantification and element2based n2gram probability approximation are present2 ed. Dynamic Viterbi algorithm is adopted to search the best path in lattice. To strengthen the model, transforma2 tion2based error2driven rules learning is adopted. Applying proposed model to Chinese Pinyin2to2character conver2 sion, high performance has been achieved in accuracy, flexibility and robustness simultaneously. Tests show correct rate achieves 94. 81 % instead of 90. 53 % using bi2gram Mark...
Two of the most popular Machine Translation (MT) paradigms are rule based (RBMT) and corpus based, w...
This article presents some experimental results on Chinese to Spanish machine translation. The imple...
As the growth of exchange activities between four regions of cross strait, the problem to correctly ...
Grapheme-to-phoneme (G2P) conversion is a very important component in a Text-to-Speech (TTS) system....
Chinese input is one of the key challenges for Chinese PC users. This paper proposes a statistical a...
A combination of statistical and rule methods has been developed for Chinese spoken language analyzi...
We address the problem of statistical language modeling in the context of PinYin to Chinese (PTC) ...
Statistical method is a good way for pinyin to Chinese characters conversion and has gotten preferab...
We have developed a two-stage machine translation (MT) system. The first stage is a rule-based machi...
The paper introduces a rough set technique for solving the problem of mining Pinyin-to-character (PT...
In this paper, we consider the problem of sparse data in probabilistic modeling of the Chinese langu...
The Pinyin-to-Character Conversion task is the core process of the Chinese pinyin-based input method...
Transcription of Chinese syllables such as Pinyin to the corresponding Chinese character (Hanzi) is ...
This paper proposes a novel method integrating multi-level linguistic knowledge for Chinese grapheme...
Statistical language models (SLM) encode linguistic information in the form of estimation of probabi...
Two of the most popular Machine Translation (MT) paradigms are rule based (RBMT) and corpus based, w...
This article presents some experimental results on Chinese to Spanish machine translation. The imple...
As the growth of exchange activities between four regions of cross strait, the problem to correctly ...
Grapheme-to-phoneme (G2P) conversion is a very important component in a Text-to-Speech (TTS) system....
Chinese input is one of the key challenges for Chinese PC users. This paper proposes a statistical a...
A combination of statistical and rule methods has been developed for Chinese spoken language analyzi...
We address the problem of statistical language modeling in the context of PinYin to Chinese (PTC) ...
Statistical method is a good way for pinyin to Chinese characters conversion and has gotten preferab...
We have developed a two-stage machine translation (MT) system. The first stage is a rule-based machi...
The paper introduces a rough set technique for solving the problem of mining Pinyin-to-character (PT...
In this paper, we consider the problem of sparse data in probabilistic modeling of the Chinese langu...
The Pinyin-to-Character Conversion task is the core process of the Chinese pinyin-based input method...
Transcription of Chinese syllables such as Pinyin to the corresponding Chinese character (Hanzi) is ...
This paper proposes a novel method integrating multi-level linguistic knowledge for Chinese grapheme...
Statistical language models (SLM) encode linguistic information in the form of estimation of probabi...
Two of the most popular Machine Translation (MT) paradigms are rule based (RBMT) and corpus based, w...
This article presents some experimental results on Chinese to Spanish machine translation. The imple...
As the growth of exchange activities between four regions of cross strait, the problem to correctly ...