Most semi-supervised methods in Natural Language Process-ing capitalize on unannotated resources in a single language; however, information can be gained from using parallel re-sources in more than one language, since translations of the same utterance in different languages can help to disam-biguate each other. We demonstrate a method that makes effective use of vast amounts of bilingual text (a.k.a. bi-text) to improve monolingual systems. We propose a factored probabilistic sequence model that encourages both cross-language and intra-document consistency. A simple Gibbs sampling algorithm is introduced for performing approximate inference. Experiments on English-Chinese Named Entity Recognition (NER) using the OntoNotes dataset demonstra...
We investigate the task of unsupervised constituency parsing from bilingual parallel corpora. Our go...
We propose a novel statistical translation model to improve translation selection of collocation. In...
Traditional approaches to supervised learning require a generous amount of labeled data for good gen...
Most semi-supervised methods in Natural Language Processing capitalize on unannotated resources in a...
Supervised learning systems require a large quantity of labeled data, which is time-consuming, expen...
For languages with no annotated resources, transferring knowledge from rich-resource languages is an...
Recently, neural methods have achieved state-of-the-art (SOTA) results in Named Entity Recognition (...
Abstract. We present a named-entity recognition (NER) system for parallel multilingual text. Our sys...
Building named entity recognition (NER) models for languages that do not have much training data is ...
Translation of named entities (NE), including proper names, temporal and numerical expressions, is v...
This paper presents a bilingual semi-supervised Chinese word segmentation (CWS) method that leverage...
Abstract. Bilingual Named Entity Extraction is important to some cross language information processe...
We propose a simple yet effective approach to learning bilingual word embeddings (BWEs) from non-par...
Abstract. Bilingual Named Entity Extraction is important to some cross language information processe...
In this paper, we describe a technique to improve named entity recognition in a resource-poor langua...
We investigate the task of unsupervised constituency parsing from bilingual parallel corpora. Our go...
We propose a novel statistical translation model to improve translation selection of collocation. In...
Traditional approaches to supervised learning require a generous amount of labeled data for good gen...
Most semi-supervised methods in Natural Language Processing capitalize on unannotated resources in a...
Supervised learning systems require a large quantity of labeled data, which is time-consuming, expen...
For languages with no annotated resources, transferring knowledge from rich-resource languages is an...
Recently, neural methods have achieved state-of-the-art (SOTA) results in Named Entity Recognition (...
Abstract. We present a named-entity recognition (NER) system for parallel multilingual text. Our sys...
Building named entity recognition (NER) models for languages that do not have much training data is ...
Translation of named entities (NE), including proper names, temporal and numerical expressions, is v...
This paper presents a bilingual semi-supervised Chinese word segmentation (CWS) method that leverage...
Abstract. Bilingual Named Entity Extraction is important to some cross language information processe...
We propose a simple yet effective approach to learning bilingual word embeddings (BWEs) from non-par...
Abstract. Bilingual Named Entity Extraction is important to some cross language information processe...
In this paper, we describe a technique to improve named entity recognition in a resource-poor langua...
We investigate the task of unsupervised constituency parsing from bilingual parallel corpora. Our go...
We propose a novel statistical translation model to improve translation selection of collocation. In...
Traditional approaches to supervised learning require a generous amount of labeled data for good gen...