For languages without word boundary delimiters, dictionaries are needed for segmenting running texts. This figure makes segmentation accuracy depend significantly on the quality of the dictionary used for analysis. If the dictionary is not sufficiently good, it will lead to a great number of unknown or unrecognized words. These unrecognized words certainly reduce segmentation accuracy. To solve such problem, we propose a method based on decision tree models. Without use of a dictionary, specific information, called syntactic attribute, is applied to identify the structure of Thai words. C4.5 is used as a tool for this purpose. Using a Thai corpus, experiment results show that our method outperforms some well-known dictionary-dependent techn...
This paper presents an application of two machine learning algorithms, i.e., Winnow and RIPPER, and ...
The development of an information extraction (IE) system for Thai documents raises a number of issue...
This study is to develop a word segmentation algorithm and solution for Myanmar language. This is a ...
The Thai written language is one of the languages that does not have word boundaries. In order to di...
Word segmentation is a problem in several Asian languages that have no explicit word boundary delimi...
In Thai language, the word boundary is not explicitly clear, therefore, word segmentation is needed ...
AbstractWord segmentation is the first step to process language that written in non-Latin letters su...
A Thai written text is a string of symbols without explicit word boundary markup. A method for a dev...
Since Thai writing system has no explicit word and sentence boundaries, language sense in Thai depen...
Unlike English, there is no explicit sentence marker in Thai language. Conventionally, a space is pl...
Abstract. Word segmentation is an important task in natural language processing, especially for lang...
This study reports the development of a Myanmar word segmentation method using Unicode standard enco...
The fact that words are not conventionally demarcated in Chinese orthography makes the process of wo...
Myanmar texts are different fromEnglish texts in that they have no spaces tomark the boundaries of w...
Word segmentation is a basic task and animportant problem in natural language processing. InMyanmar ...
This paper presents an application of two machine learning algorithms, i.e., Winnow and RIPPER, and ...
The development of an information extraction (IE) system for Thai documents raises a number of issue...
This study is to develop a word segmentation algorithm and solution for Myanmar language. This is a ...
The Thai written language is one of the languages that does not have word boundaries. In order to di...
Word segmentation is a problem in several Asian languages that have no explicit word boundary delimi...
In Thai language, the word boundary is not explicitly clear, therefore, word segmentation is needed ...
AbstractWord segmentation is the first step to process language that written in non-Latin letters su...
A Thai written text is a string of symbols without explicit word boundary markup. A method for a dev...
Since Thai writing system has no explicit word and sentence boundaries, language sense in Thai depen...
Unlike English, there is no explicit sentence marker in Thai language. Conventionally, a space is pl...
Abstract. Word segmentation is an important task in natural language processing, especially for lang...
This study reports the development of a Myanmar word segmentation method using Unicode standard enco...
The fact that words are not conventionally demarcated in Chinese orthography makes the process of wo...
Myanmar texts are different fromEnglish texts in that they have no spaces tomark the boundaries of w...
Word segmentation is a basic task and animportant problem in natural language processing. InMyanmar ...
This paper presents an application of two machine learning algorithms, i.e., Winnow and RIPPER, and ...
The development of an information extraction (IE) system for Thai documents raises a number of issue...
This study is to develop a word segmentation algorithm and solution for Myanmar language. This is a ...