This article presents a pragmatic approach to Chinese word segmentation. It differs from most previous approaches mainly in three respects. First, while theoretical linguists have defined Chinese words using various linguistic criteria, Chinese words in this study are defined prag-matically as segmentation units whose definition depends on how they are used and processed in realistic computer applications. Second, we propose a pragmatic mathematical framework in which segmenting known words and detecting unknown words of different types (i.e., morpho-logically derived words, factoids, named entities, and other unlisted words) can be performed simultaneously in a unified way. These tasks are usually conducted separately in other systems. Fin...
As the amount of online Chinese contents grows, there is a critical need for effective Chinese word ...
Chinese word segmentation is the first step for Chinese text processing. The accuracy of Chinese wor...
A Chinese sentence is typically written as a sequence of characters. However, a word, a logical sema...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
Abstract- Proposed an approach of Chinese word segmenta-tion based on statistic and rules. The appro...
With development in Chinese words segmentation, in-vocabulary word segmentation and named entity rec...
This paper addresses two remaining challenges in Chinese word segmentation. The challenge in HLT is ...
(Alias-i 2006) to Chinese word segmentation and named en-tity recognition. We provide results for th...
The fact that words are not conventionally demarcated in Chinese orthography makes the process of wo...
The Chinese language, unlike English, is written without marked word boundaries, and Chinese word se...
This paper describes a novel method about domain adaptive Chinese Word Segmentation. Unlike traditio...
Abstract. This paper proposes an empirical comparison between word-based method and character-based ...
Word segmentation is the first step in Chinese information processing, and the performance of the se...
Among statistical approaches to Chinese word segmentation, the word-based n-gram (generative) model ...
As the amount of online Chinese contents grows, there is a critical need for effective Chinese word ...
Chinese word segmentation is the first step for Chinese text processing. The accuracy of Chinese wor...
A Chinese sentence is typically written as a sequence of characters. However, a word, a logical sema...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
Abstract- Proposed an approach of Chinese word segmenta-tion based on statistic and rules. The appro...
With development in Chinese words segmentation, in-vocabulary word segmentation and named entity rec...
This paper addresses two remaining challenges in Chinese word segmentation. The challenge in HLT is ...
(Alias-i 2006) to Chinese word segmentation and named en-tity recognition. We provide results for th...
The fact that words are not conventionally demarcated in Chinese orthography makes the process of wo...
The Chinese language, unlike English, is written without marked word boundaries, and Chinese word se...
This paper describes a novel method about domain adaptive Chinese Word Segmentation. Unlike traditio...
Abstract. This paper proposes an empirical comparison between word-based method and character-based ...
Word segmentation is the first step in Chinese information processing, and the performance of the se...
Among statistical approaches to Chinese word segmentation, the word-based n-gram (generative) model ...
As the amount of online Chinese contents grows, there is a critical need for effective Chinese word ...
Chinese word segmentation is the first step for Chinese text processing. The accuracy of Chinese wor...
A Chinese sentence is typically written as a sequence of characters. However, a word, a logical sema...