International audienceChinese characters have a complex and hierarchical graphical structure carrying both semantic and phonetic information. We use this structure to enhance the text model and obtain better results in standard NLP operations. First of all, to tackle the problem of graphical variation we define allographic classes of characters. Next, the relation of inclusion of a subcharacter in a characters, provides us with a directed graph of allographic classes. We provide this graph with two weights: semanticity (semantic relation between subcharacter and character) and phoneticity (phonetic relation) and calculate "most semantic subcharacter paths" for each character. Finally, adding the information contained in these paths to unigr...
This paper describes an on-going project concerning with an ontological lexical re-source based on t...
Chinese characters have semantic-rich compositional information in radical form. While almost all pr...
The Chinese language is based on characters which are syllabic in nature. Since languages have sylla...
International audienceChinese characters have a complex and hierarchical graphical structure carryin...
This thesis deals with Chinese characters (Hanzi): their key characteristics and how they could be u...
Designations have been used very inconsistently in deciphering the nature of the Chinese writing sys...
This thesis looks into the problem of learning Chinese characters for foreign language learners and ...
Chinese script is non-alphabetic and a Chinese graph is basically syllabic which may consist of phon...
The complexity of Chinese orthography has hindered the progress of research in Chinese to the same l...
This paper describes a system for handwritten Chinese text recognition integrating language model. O...
It has been shown through a number of experiments that neural networks can be used for a phonetic ty...
This paper presents a comprehensive comparison study of various learning-based approaches for Chines...
International audienceLearning a language such as Mandarin Chinese includes specific challenges. A c...
We propose a new goal for constructing a Chinese phoneme-to-character automatic conversion system. I...
Hill NW, List J-M. Using Chinese Character Formation Graphs to Test Proposals in Chinese Historical ...
This paper describes an on-going project concerning with an ontological lexical re-source based on t...
Chinese characters have semantic-rich compositional information in radical form. While almost all pr...
The Chinese language is based on characters which are syllabic in nature. Since languages have sylla...
International audienceChinese characters have a complex and hierarchical graphical structure carryin...
This thesis deals with Chinese characters (Hanzi): their key characteristics and how they could be u...
Designations have been used very inconsistently in deciphering the nature of the Chinese writing sys...
This thesis looks into the problem of learning Chinese characters for foreign language learners and ...
Chinese script is non-alphabetic and a Chinese graph is basically syllabic which may consist of phon...
The complexity of Chinese orthography has hindered the progress of research in Chinese to the same l...
This paper describes a system for handwritten Chinese text recognition integrating language model. O...
It has been shown through a number of experiments that neural networks can be used for a phonetic ty...
This paper presents a comprehensive comparison study of various learning-based approaches for Chines...
International audienceLearning a language such as Mandarin Chinese includes specific challenges. A c...
We propose a new goal for constructing a Chinese phoneme-to-character automatic conversion system. I...
Hill NW, List J-M. Using Chinese Character Formation Graphs to Test Proposals in Chinese Historical ...
This paper describes an on-going project concerning with an ontological lexical re-source based on t...
Chinese characters have semantic-rich compositional information in radical form. While almost all pr...
The Chinese language is based on characters which are syllabic in nature. Since languages have sylla...