The paper introduces a rough set technique for solving the problem of mining Pinyin-to-character (PTC) conversion rules. It first presents a text-structuring method by constructing a language information table from a corpus for each pinyin, which it will then apply to a free-form textual corpus. Data generalization and rule extraction algorithms can then be used to eliminate redundant information and extract consistent PTC conversion rules. The design of our model also addresses a number of important issues such as the long-distance dependency problem, the storage requirements of the rule base, and the consistency of the extracted rules, while the performance of the extracted rules as well as the effects of different model parameters are ev...
This paper is concerned with building linguistic resources and statistical parsers for deep grammati...
We have recently developed a Chinese phoneme-to-character conversion system with a conversion rate c...
Transcription of Chinese syllables such as Pinyin to the corresponding Chinese character (Hanzi) is ...
Abstract—This paper introduces a rough set technique for solving the problem of mining Pinyin-to-cha...
Statistical method is a good way for pinyin to Chinese characters conversion and has gotten preferab...
Grapheme-to-phoneme (G2P) conversion is a very important component in a Text-to-Speech (TTS) system....
This report explores the possibility of creating a Pinyin Character conversion system which is more ...
The Pinyin-to-Character Conversion task is the core process of the Chinese pinyin-based input method...
We address the problem of statistical language modeling in the context of PinYin to Chinese (PTC) ...
In this article, we propose a new postprocessing strategy, word suggestion, based on a multiple word...
We propose a new goal for constructing a Chinese phoneme-to-character automatic conversion system. I...
Chinese input is one of the key challenges for Chinese PC users. This paper proposes a statistical a...
An N2gram Chinese language model incorporating linguistic rules is presented. By constructing elemen...
International audienceChinese characters have a complex and hierarchical graphical structure carryin...
This paper reveals some important properties of CFSs and applications in Chinese natural language pr...
This paper is concerned with building linguistic resources and statistical parsers for deep grammati...
We have recently developed a Chinese phoneme-to-character conversion system with a conversion rate c...
Transcription of Chinese syllables such as Pinyin to the corresponding Chinese character (Hanzi) is ...
Abstract—This paper introduces a rough set technique for solving the problem of mining Pinyin-to-cha...
Statistical method is a good way for pinyin to Chinese characters conversion and has gotten preferab...
Grapheme-to-phoneme (G2P) conversion is a very important component in a Text-to-Speech (TTS) system....
This report explores the possibility of creating a Pinyin Character conversion system which is more ...
The Pinyin-to-Character Conversion task is the core process of the Chinese pinyin-based input method...
We address the problem of statistical language modeling in the context of PinYin to Chinese (PTC) ...
In this article, we propose a new postprocessing strategy, word suggestion, based on a multiple word...
We propose a new goal for constructing a Chinese phoneme-to-character automatic conversion system. I...
Chinese input is one of the key challenges for Chinese PC users. This paper proposes a statistical a...
An N2gram Chinese language model incorporating linguistic rules is presented. By constructing elemen...
International audienceChinese characters have a complex and hierarchical graphical structure carryin...
This paper reveals some important properties of CFSs and applications in Chinese natural language pr...
This paper is concerned with building linguistic resources and statistical parsers for deep grammati...
We have recently developed a Chinese phoneme-to-character conversion system with a conversion rate c...
Transcription of Chinese syllables such as Pinyin to the corresponding Chinese character (Hanzi) is ...