Abstract. Since the traditional word-based n-gram model, a generative approach, cannot handle those out-of-vocabulary (OOV) words in the testing-set, the character-based discriminative approach has been widely adopted recently. However, this discriminative model, though is more robust to OOV words, fails to deliver satisfactory performance for those in-vocabulary (IV) words that have been observed before. Having analyzed the word-based approach, its capability to handle the dependency between adjacent characters within a word, which is believed that the human adopts for doing segmentation, is found to account for its excellent performance for those IV words. To incorporate the intra-word characters dependency, a character-based approach wit...
We investigate whether suffix related features can significantly improve the performance of characte...
tion (CWS) Bakeoff on 2003, CWS has experienced a prominent flourish be-cause Bakeoff provides a pla...
The fact that words are not conventionally demarcated in Chinese orthography makes the process of wo...
Among statistical approaches to Chinese word segmentation, the word-based n-gram (generative) model ...
Abstract. This paper proposes an empirical comparison between word-based method and character-based ...
Current character-based approaches are not robust for cross domain Chinese word segmentation. In thi...
This paper presents our system for the CIPS-SIGHAN-2014 bakeoff task of Chinese word segmentation. T...
This paper addresses two remaining challenges in Chinese word segmentation. The challenge in HLT is ...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
This article presents a pragmatic approach to Chinese word segmentation. It differs from most previo...
This paper describes a hybrid Chinese word segmenter that is being developed as part of a larger Chi...
This paper describes the Chinese Word Segmenter for the fourth International Chinese Language Proces...
Synthetic word analysis is a potentially important but relatively unexplored problem in Chinese natu...
We investigate whether suffix related features can significantly improve the performance of characte...
tion (CWS) Bakeoff on 2003, CWS has experienced a prominent flourish be-cause Bakeoff provides a pla...
The fact that words are not conventionally demarcated in Chinese orthography makes the process of wo...
Among statistical approaches to Chinese word segmentation, the word-based n-gram (generative) model ...
Abstract. This paper proposes an empirical comparison between word-based method and character-based ...
Current character-based approaches are not robust for cross domain Chinese word segmentation. In thi...
This paper presents our system for the CIPS-SIGHAN-2014 bakeoff task of Chinese word segmentation. T...
This paper addresses two remaining challenges in Chinese word segmentation. The challenge in HLT is ...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
In this paper, we propose a joint model for unsupervised Chinese word segmentation (CWS). Inspired b...
This article presents a pragmatic approach to Chinese word segmentation. It differs from most previo...
This paper describes a hybrid Chinese word segmenter that is being developed as part of a larger Chi...
This paper describes the Chinese Word Segmenter for the fourth International Chinese Language Proces...
Synthetic word analysis is a potentially important but relatively unexplored problem in Chinese natu...
We investigate whether suffix related features can significantly improve the performance of characte...
tion (CWS) Bakeoff on 2003, CWS has experienced a prominent flourish be-cause Bakeoff provides a pla...
The fact that words are not conventionally demarcated in Chinese orthography makes the process of wo...