Languages change over time and ancient languages have been studied in linguistics and other related fields. A main challenge in this research area is the lack of empirical data; for instance, ancient spoken languages often leave little trace of their linguistic properties. From the perspective of natural language processing (NLP), while the NLP community has created dozens of annotated corpora, very few of them are on ancient languages. As an effort toward bridging the gap, we have created a word segmented and POS tagged corpus for Archaic Chinese using articles from Huainanzi, a book written during China’s Western Han Dynasty (206 BC-9 AD). We then compare this corpus with the Chinese Penn Treebank (CTB), a well-known corpus for Modern Chi...
By comparing the languages of the world, we gain invaluable insights into human prehistory, predatin...
The evolution of language follows the rule of gradual change. Grammar, vocabulary, and lexical seman...
Over the last few decades, the wide diffusion of digital technology and the growing ease of transfer...
International audienceThis work is part of a broader project which requires adapting information ext...
International audienceThis work is part of a broader project which requires adapting information ext...
The article provides brief information about the Chinese language and Chinese language communities (...
This monograph is a translation of two seminal works on corpus-based studies of Mandarin Chinese wor...
This monograph is a translation of two seminal works on corpus-based studies of Mandarin Chinese wor...
<div><p>Abstract</p><p>In this study, we compare statistical properties of ancient and modern Chines...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
Abstract—This paper introduces a program practice on Middle Chinese corpus for CLP and historical gr...
In this study, we compare statistical properties of ancient and modern Chinese within the framework ...
International audiencePhylogeny-based network approaches are a powerful tool to study language histo...
Can we use NLP to extract information about long-dead languages from secondary sources more than a m...
International audiencePhylogeny-based network approaches are a powerful tool to study language histo...
By comparing the languages of the world, we gain invaluable insights into human prehistory, predatin...
The evolution of language follows the rule of gradual change. Grammar, vocabulary, and lexical seman...
Over the last few decades, the wide diffusion of digital technology and the growing ease of transfer...
International audienceThis work is part of a broader project which requires adapting information ext...
International audienceThis work is part of a broader project which requires adapting information ext...
The article provides brief information about the Chinese language and Chinese language communities (...
This monograph is a translation of two seminal works on corpus-based studies of Mandarin Chinese wor...
This monograph is a translation of two seminal works on corpus-based studies of Mandarin Chinese wor...
<div><p>Abstract</p><p>In this study, we compare statistical properties of ancient and modern Chines...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
Abstract—This paper introduces a program practice on Middle Chinese corpus for CLP and historical gr...
In this study, we compare statistical properties of ancient and modern Chinese within the framework ...
International audiencePhylogeny-based network approaches are a powerful tool to study language histo...
Can we use NLP to extract information about long-dead languages from secondary sources more than a m...
International audiencePhylogeny-based network approaches are a powerful tool to study language histo...
By comparing the languages of the world, we gain invaluable insights into human prehistory, predatin...
The evolution of language follows the rule of gradual change. Grammar, vocabulary, and lexical seman...
Over the last few decades, the wide diffusion of digital technology and the growing ease of transfer...