Chinese-English parallel corpora are key resources for Chinese-English cross-language information processing, Chinese-English bilingual lexicography, Chinese-English language research and teaching. But so far large-scale Chinese-English corpus is still unavailable yet, given the difficulties and the intensive labours required. In this paper, our work towards building a large-scale Chinese-English parallel corpus is presented. We elaborate on the collection, annotation and mark-up of the parallel Chinese-English texts and the workflow that we used to construct the corpus. In addition, we also present our work toward building tools for constructing and using the corpus easily for different purposes. Among these tools, a parallel concordance t...
Although there are increasing and significant ties between China and Portuguese-speaking countries, ...
In an increasingly globalized world, being able to understand texts in different languages (even mor...
We discuss some of the issues in produc-ing sense-tagged parallel corpora: includ-ing pre-processing...
Chinese-English parallel corpora are key resources for Chinese-English cross-language information pr...
Chinese-English parallel corpora are key resources for Chinese-English cross-language information pr...
This paper describes the constructing of a large-scale (above 500,000 pair sentences) Chinese-Englis...
Parallel corpus is a valuable resource for cross-language information retrieval and data-driven natu...
This paper first describes an experiment to construct an English-Chinese parallel corpus, then apply...
We are constricting a Japanese-Chinese parallel corpus, which is a part of the NICT Multilingual Cor...
Most Chinese-English parallel corpora consist of English source texts translated into Chinese. This ...
AbstractIn order to provide sufficient training data for statistical machine(-aided) translation in ...
information in languages other than English has grown significantly in recent years. This highlights...
The translation quality of Neural Machine Translation (NMT) systems depends strongly on the training...
We report experimental results on automatic extraction of an English-Chinese translation lexicon, by...
Parallel corpora are a crucial resource in research fields such as cross-lingual infor-mation retrie...
Although there are increasing and significant ties between China and Portuguese-speaking countries, ...
In an increasingly globalized world, being able to understand texts in different languages (even mor...
We discuss some of the issues in produc-ing sense-tagged parallel corpora: includ-ing pre-processing...
Chinese-English parallel corpora are key resources for Chinese-English cross-language information pr...
Chinese-English parallel corpora are key resources for Chinese-English cross-language information pr...
This paper describes the constructing of a large-scale (above 500,000 pair sentences) Chinese-Englis...
Parallel corpus is a valuable resource for cross-language information retrieval and data-driven natu...
This paper first describes an experiment to construct an English-Chinese parallel corpus, then apply...
We are constricting a Japanese-Chinese parallel corpus, which is a part of the NICT Multilingual Cor...
Most Chinese-English parallel corpora consist of English source texts translated into Chinese. This ...
AbstractIn order to provide sufficient training data for statistical machine(-aided) translation in ...
information in languages other than English has grown significantly in recent years. This highlights...
The translation quality of Neural Machine Translation (NMT) systems depends strongly on the training...
We report experimental results on automatic extraction of an English-Chinese translation lexicon, by...
Parallel corpora are a crucial resource in research fields such as cross-lingual infor-mation retrie...
Although there are increasing and significant ties between China and Portuguese-speaking countries, ...
In an increasingly globalized world, being able to understand texts in different languages (even mor...
We discuss some of the issues in produc-ing sense-tagged parallel corpora: includ-ing pre-processing...