Nowadays supervised sequence labeling models can reach competitive performance on the task of Chinese word segmenta-tion. However, the ability of these mod-els is restricted by the availability of an-notated data and the design of features. We propose a scalable semi-supervised fea-ture engineering approach. In contrast to previous works using pre-defined task-specific features with fixed values, we dy-namically extract representations of label distributions from both an in-domain cor-pus and an out-of-domain corpus. We update the representation values with a semi-supervised approach. Experiments on the benchmark datasets show that our approach achieve good results and reach an f-score of 0.961. The feature engineer-ing approach proposed he...
Currently most of state-of-the-art methods for Chinese word segmentation (CWS) are based on supervis...
This paper describes the system that we use for Chinese segmentation task in the 3rd CIPS-SIGHAN bak...
This study explores the feasibility of perform-ing Chinese word segmentation (CWS) and POS tagging b...
Nowadays supervised sequence labeling models can reach competitive performance on the task of Chines...
In this paper we report an empirical study on semi-supervised Chinese word segmenta-tion using co-tr...
There is rich knowledge encoded in on-line web data. For example, punctua-tion and entity tags in Wi...
This paper presents a bilingual semi-supervised Chinese word segmentation (CWS) method that leverage...
By exploiting unlabeled data for further performance improvement for Chinese word segmentation, this...
Almost all Chinese language processing tasks involve word segmentation of the language input as thei...
In this article, we focus on Chinese word segmentation by systematically incorporating non-local inf...
This paper presents a novel approach to Chinese word segmentation (CWS) that attempts to utilize glo...
This paper presents a novel approach to Chinese word segmentation (CWS) that attempts to utilize glo...
Although there has been significant previous work on semi-supervised learning for classification, th...
This paper introduces an approach which jointly performs a cascade of segmentation and labeling subt...
Current character-based approaches are not robust for cross domain Chinese word segmentation. In thi...
Currently most of state-of-the-art methods for Chinese word segmentation (CWS) are based on supervis...
This paper describes the system that we use for Chinese segmentation task in the 3rd CIPS-SIGHAN bak...
This study explores the feasibility of perform-ing Chinese word segmentation (CWS) and POS tagging b...
Nowadays supervised sequence labeling models can reach competitive performance on the task of Chines...
In this paper we report an empirical study on semi-supervised Chinese word segmenta-tion using co-tr...
There is rich knowledge encoded in on-line web data. For example, punctua-tion and entity tags in Wi...
This paper presents a bilingual semi-supervised Chinese word segmentation (CWS) method that leverage...
By exploiting unlabeled data for further performance improvement for Chinese word segmentation, this...
Almost all Chinese language processing tasks involve word segmentation of the language input as thei...
In this article, we focus on Chinese word segmentation by systematically incorporating non-local inf...
This paper presents a novel approach to Chinese word segmentation (CWS) that attempts to utilize glo...
This paper presents a novel approach to Chinese word segmentation (CWS) that attempts to utilize glo...
Although there has been significant previous work on semi-supervised learning for classification, th...
This paper introduces an approach which jointly performs a cascade of segmentation and labeling subt...
Current character-based approaches are not robust for cross domain Chinese word segmentation. In thi...
Currently most of state-of-the-art methods for Chinese word segmentation (CWS) are based on supervis...
This paper describes the system that we use for Chinese segmentation task in the 3rd CIPS-SIGHAN bak...
This study explores the feasibility of perform-ing Chinese word segmentation (CWS) and POS tagging b...