Since a Chinese syllable can correspond to many characters (homophones), the syllable-to-character conversion task is quite challenging for Chinese phonetic input methods (CPIM). There are usually two stages in a CPIM: 1. segment the syllable sequence into syllable words; 2. select the most likely character words for each syllable word. A CPIM usually assumes that the input is a complete sentence, and evaluates the performance based on a well-formed corpus. However, in practice, most Pinyin users prefer progressive text entry in several short chunks, mainly in one or two words each (most Chinese words consist of two or more characters). Short chunks do not provide enough contexts to perform the best possible syllable-to-character conversion...
Chinese Syllable-to-Character (S2C) conversion is the important component for Input Methods, and the...
Syllable-to-word (STW) conversion is a frequently used Chinese input method that is fundamental to s...
In real speech, not like lexical words (LWs), prosodic words (PWs) are basic rhythmic units. The nat...
Many practical speech recognition applications such as in information retrieval need to be portable ...
The Chinese language is written without using spaces or other word delimiters. Although a text may b...
The fact that words are not conventionally demarcated in Chinese orthography makes the process of wo...
The Chinese language is written without using spaces or other word delimiters. Although a text may b...
A Chinese sentence is typically written as a sequence of characters. However, a word, a logical sema...
In 3 experiments, we tested 3 possible mechanisms for segmenting overlapping ambiguous strings in Ch...
This paper addresses two remaining challenges in Chinese word segmentation. The challenge in HLT is ...
Large alphabet languages such as Chinese present different problems for language modelling compared ...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
Chinese Syllable-to-Character (S2C) conversion is the important component for Input Methods, and the...
Chinese Syllable-to-Character (S2C) conversion is the important component for Input Methods, and the...
Syllable-to-word (STW) conversion is a frequently used Chinese input method that is fundamental to s...
In real speech, not like lexical words (LWs), prosodic words (PWs) are basic rhythmic units. The nat...
Many practical speech recognition applications such as in information retrieval need to be portable ...
The Chinese language is written without using spaces or other word delimiters. Although a text may b...
The fact that words are not conventionally demarcated in Chinese orthography makes the process of wo...
The Chinese language is written without using spaces or other word delimiters. Although a text may b...
A Chinese sentence is typically written as a sequence of characters. However, a word, a logical sema...
In 3 experiments, we tested 3 possible mechanisms for segmenting overlapping ambiguous strings in Ch...
This paper addresses two remaining challenges in Chinese word segmentation. The challenge in HLT is ...
Large alphabet languages such as Chinese present different problems for language modelling compared ...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
Chinese Syllable-to-Character (S2C) conversion is the important component for Input Methods, and the...
Chinese Syllable-to-Character (S2C) conversion is the important component for Input Methods, and the...
Syllable-to-word (STW) conversion is a frequently used Chinese input method that is fundamental to s...
In real speech, not like lexical words (LWs), prosodic words (PWs) are basic rhythmic units. The nat...