Distributional Similarity has attracted considerable attention in the field of natural language processing as an automatic means of countering the ubiquitous problem of sparse data. As a logographic language, Chinese words consist of characters and each of them is composed of one or more radicals. The meanings of characters are usually highly related to the words which contain them. Likewise, radicals often make a predictable contribution to the meaning of a character: characters that have the same components tend to have similar or related meanings. In this paper, we utilize these properties of the Chinese language to improve Chinese word similarity computation. Given a content word, we first extract similar words based on a large corpus a...
Posters Session 3 - abstract no. PS3:10Two experiments investigated whether knowledge of semantic ra...
Distributed word representations are very useful for capturing semantic information and have been su...
We show that the Zipf’s law for Chinese characters perfectly holds for sufficiently short ...
Distributional Similarity has attracted considerable attention in the field of natural language proc...
In the Chinese language, words consist of characters each of which is composed of one or more compon...
© 2017 Institute of Information Science. All Rights Reserved. Automatically detecting similar Chines...
Automatically identifying Chinese characters that are similar in their glyph, pronunciations and mea...
Information about students ’ mistakes opens a window to an understanding of their learning processes...
In this paper we propose a novel word representation for Chinese based on a state-of-the-art word em...
So far, most Chinese natural language processing neglects the punctuations or oversimplifies their f...
The research reported investigates word recognition in Chinese. A one-character Chinese word is comp...
The concepts of consistency and regularity characterize the orthography-to-phonology mappings of wri...
With about 90% of all characters in a Chinese dictionary belonging to the semantic-phonetic compound...
Little research has been done about the neural substrate of the sublexical level of Chinese word rec...
Poster Session B: ReadiNg & WritiNgIntroduction: About 34% of 4-letter words in English and French ...
Posters Session 3 - abstract no. PS3:10Two experiments investigated whether knowledge of semantic ra...
Distributed word representations are very useful for capturing semantic information and have been su...
We show that the Zipf’s law for Chinese characters perfectly holds for sufficiently short ...
Distributional Similarity has attracted considerable attention in the field of natural language proc...
In the Chinese language, words consist of characters each of which is composed of one or more compon...
© 2017 Institute of Information Science. All Rights Reserved. Automatically detecting similar Chines...
Automatically identifying Chinese characters that are similar in their glyph, pronunciations and mea...
Information about students ’ mistakes opens a window to an understanding of their learning processes...
In this paper we propose a novel word representation for Chinese based on a state-of-the-art word em...
So far, most Chinese natural language processing neglects the punctuations or oversimplifies their f...
The research reported investigates word recognition in Chinese. A one-character Chinese word is comp...
The concepts of consistency and regularity characterize the orthography-to-phonology mappings of wri...
With about 90% of all characters in a Chinese dictionary belonging to the semantic-phonetic compound...
Little research has been done about the neural substrate of the sublexical level of Chinese word rec...
Poster Session B: ReadiNg & WritiNgIntroduction: About 34% of 4-letter words in English and French ...
Posters Session 3 - abstract no. PS3:10Two experiments investigated whether knowledge of semantic ra...
Distributed word representations are very useful for capturing semantic information and have been su...
We show that the Zipf’s law for Chinese characters perfectly holds for sufficiently short ...