[[abstract]]For languages with characters,it is critical to design the effective retrieval system between their phonetic transcriptions and corresponding characters.In Taiwan society,Mandarin,Taiwanese and Hakka are three major and the most popular dialects of Hart languages spoken. Their writing systems are all in Hart characters and also with their respective Romanized phonetic transcription.In this paper,we propose a minimal perfect hashing function to 4135 unified Mandarin,Taiwanese and Hakka existing Romanized phonetic transcriptions to their corresponding Hart characters.Compared to the hashing designs based on the Chinese remainder theorem for various data sets,the proposed design is shown to be superior in space utilization. This no...
This article provides background information on the development of three line-ages of character sets...
One of the popular input systems is based on Chinese phonetic symbols. Designing such kind of a syll...
[[abstract]]Corpora, in their different forms for different purposes, have been the bases for modern...
In order to preserve distinctive cultures, people anxiously figure out writing systems of their lang...
[[abstract]]In Taiwan, Taiwanese is used about 85% of the population, so It is the most populous lan...
ABSTRACT In this paper, 1we survey the Chinese polyphonic characters and introduce our solution to c...
Chinese input method is one of the most difficult problems in Chinese Language Processing. And to in...
Unicode 6.1 (2012) had encoded more than 74,000 Han characters. This great repertory could solve the...
The Chinese and Japanese languages share Chinese characters. Since the Chinese characters in Japanes...
International audienceChinese characters have a complex and hierarchical graphical structure carryin...
Chinese input is one of the key challenges for Chinese PC users. This paper proposes a statistical a...
The usefulness of accurate sequence information is re-evaluated in this paper. A novel idea, called ...
The complexity of Chinese orthography has hindered the progress of research in Chinese to the same l...
Most studies on Mandarin HTS (HMM-based text-to-speech system) have taken the initial/final as the ...
Due to the possibilities of computer typesetting new forms of word representation with various scrip...
This article provides background information on the development of three line-ages of character sets...
One of the popular input systems is based on Chinese phonetic symbols. Designing such kind of a syll...
[[abstract]]Corpora, in their different forms for different purposes, have been the bases for modern...
In order to preserve distinctive cultures, people anxiously figure out writing systems of their lang...
[[abstract]]In Taiwan, Taiwanese is used about 85% of the population, so It is the most populous lan...
ABSTRACT In this paper, 1we survey the Chinese polyphonic characters and introduce our solution to c...
Chinese input method is one of the most difficult problems in Chinese Language Processing. And to in...
Unicode 6.1 (2012) had encoded more than 74,000 Han characters. This great repertory could solve the...
The Chinese and Japanese languages share Chinese characters. Since the Chinese characters in Japanes...
International audienceChinese characters have a complex and hierarchical graphical structure carryin...
Chinese input is one of the key challenges for Chinese PC users. This paper proposes a statistical a...
The usefulness of accurate sequence information is re-evaluated in this paper. A novel idea, called ...
The complexity of Chinese orthography has hindered the progress of research in Chinese to the same l...
Most studies on Mandarin HTS (HMM-based text-to-speech system) have taken the initial/final as the ...
Due to the possibilities of computer typesetting new forms of word representation with various scrip...
This article provides background information on the development of three line-ages of character sets...
One of the popular input systems is based on Chinese phonetic symbols. Designing such kind of a syll...
[[abstract]]Corpora, in their different forms for different purposes, have been the bases for modern...