In this paper, we proposed a Chinese word segmentation model for micro-blog text. Alt-hough Conditional Random Fields (CRFs) models have been presented to deal with word segmentation, this is still the first time to apply it for the segmentation in the domain of Chi-nese micro-blog. Different from the genres of common articles, micro-blog has gradually be-come a new literary with the development of Internet. However, the unavailable of micro-blog training data has been the obstacle to de-velop a good segmenter based on trainable models. Considering the linguistic characteris-tics of the text, we proposed some methods to make the CRFs models suitable for segmenta-tion in the domain of micro-blog. Several ex-periments have been conducted with...
This paper presents a Chinese word segmentation system submitted to the closed training evaluations ...
Abstract Background Chinese word segmentation (CWS) and part-of-speech (POS) tagging are two fundame...
In this paper, we describe a Chinese word segmentation system that we de-veloped for the Third SIGHA...
In this evaluation, we have taken part in the task of the Word Segmentation on Chinese MicroBlog. In...
This paper proposed a Hidden Markov Model (HMM) based tokenizer for Chi-nese micro-blog texts. Compa...
The state-of-the-art Chinese word segmentation systems have achieved high performance on well-formed...
This thesis proposes an approach to generating n-gram features for Conditional Random Fields (CRFs) ...
Chinese word segmentation is a difficult, im-portant and widely-studied sequence modeling problem. T...
Chinese word segmentation is a difficult, important and widely-studied sequence modeling problem. Th...
This paper presents our system for the CIPS-SIGHAN-2014 bakeoff task of Chinese word segmentation. T...
Chinese word segmentation is a difficult, important and widely-studied sequence modeling problem. Th...
This paper presents a novel approach to Chinese word segmentation (CWS) that attempts to utilize glo...
This paper presents a novel approach to Chinese word segmentation (CWS) that attempts to utilize glo...
This paper describes the Chinese Word Segmenter for the fourth International Chinese Language Proces...
Almost all Chinese language processing tasks involve word segmentation of the language input as thei...
This paper presents a Chinese word segmentation system submitted to the closed training evaluations ...
Abstract Background Chinese word segmentation (CWS) and part-of-speech (POS) tagging are two fundame...
In this paper, we describe a Chinese word segmentation system that we de-veloped for the Third SIGHA...
In this evaluation, we have taken part in the task of the Word Segmentation on Chinese MicroBlog. In...
This paper proposed a Hidden Markov Model (HMM) based tokenizer for Chi-nese micro-blog texts. Compa...
The state-of-the-art Chinese word segmentation systems have achieved high performance on well-formed...
This thesis proposes an approach to generating n-gram features for Conditional Random Fields (CRFs) ...
Chinese word segmentation is a difficult, im-portant and widely-studied sequence modeling problem. T...
Chinese word segmentation is a difficult, important and widely-studied sequence modeling problem. Th...
This paper presents our system for the CIPS-SIGHAN-2014 bakeoff task of Chinese word segmentation. T...
Chinese word segmentation is a difficult, important and widely-studied sequence modeling problem. Th...
This paper presents a novel approach to Chinese word segmentation (CWS) that attempts to utilize glo...
This paper presents a novel approach to Chinese word segmentation (CWS) that attempts to utilize glo...
This paper describes the Chinese Word Segmenter for the fourth International Chinese Language Proces...
Almost all Chinese language processing tasks involve word segmentation of the language input as thei...
This paper presents a Chinese word segmentation system submitted to the closed training evaluations ...
Abstract Background Chinese word segmentation (CWS) and part-of-speech (POS) tagging are two fundame...
In this paper, we describe a Chinese word segmentation system that we de-veloped for the Third SIGHA...