Achieving accurate translation, especially in multiple domain documents with statistical machine translation systems, requires more and more bilingual texts and this need becomes more critical when training such systems for language pairs with scarce training data. In the recent years, there have been some researches on new sources of parallel texts that are documents which are not necessarily parallel but are comparable. Since these methods search for possible translation equivalences in a greedy manner, they are unable to consider all possible parallel texts in comparable documents. This paper investigates a different approach for this need by considering relationships between all words of two comparable documents, which works fairly well...
In statistical machine translation, large numbers of parallel sentences are required to train the ...
Parallel corpora are crucial for statistical machine translation (SMT); however, they are quite scar...
International audienceIn this article, we present a simple and effective approach for extracting bil...
Although parallel sentences rarely exist in quasi–comparable corpora, there could be parallel fragme...
This paper proposes a novel method for exploiting comparable documents to generate parallel data fo...
AbstractParallel sentences are a relatively scarce but extremely useful resource for many applicatio...
In statistical machine translation, large numbers of parallel sentences are required to train the mo...
We explore the usability of different bilingual corpora for the purpose of multilingual and cross-li...
Parallel text is one of the most valuable resources for development of statistical machine translati...
Abstract. Parallel corpora are playing a crucial role in multilingual natural language processing. U...
In this work we present an approach for extracting parallel phrases from comparable news articles to...
cfl Springer-Verlag Abstract. Most methods to extract bilingual lexicons from parallel corpora learn...
Building parallel resources for corpus based machine translation, especially Statistical Machine Tra...
We present a novel paraphrase fragment pair extraction method that uses a monolingual comparable cor...
Abstract. This paper presents a new learning method for automatic acquisition of translation knowled...
In statistical machine translation, large numbers of parallel sentences are required to train the ...
Parallel corpora are crucial for statistical machine translation (SMT); however, they are quite scar...
International audienceIn this article, we present a simple and effective approach for extracting bil...
Although parallel sentences rarely exist in quasi–comparable corpora, there could be parallel fragme...
This paper proposes a novel method for exploiting comparable documents to generate parallel data fo...
AbstractParallel sentences are a relatively scarce but extremely useful resource for many applicatio...
In statistical machine translation, large numbers of parallel sentences are required to train the mo...
We explore the usability of different bilingual corpora for the purpose of multilingual and cross-li...
Parallel text is one of the most valuable resources for development of statistical machine translati...
Abstract. Parallel corpora are playing a crucial role in multilingual natural language processing. U...
In this work we present an approach for extracting parallel phrases from comparable news articles to...
cfl Springer-Verlag Abstract. Most methods to extract bilingual lexicons from parallel corpora learn...
Building parallel resources for corpus based machine translation, especially Statistical Machine Tra...
We present a novel paraphrase fragment pair extraction method that uses a monolingual comparable cor...
Abstract. This paper presents a new learning method for automatic acquisition of translation knowled...
In statistical machine translation, large numbers of parallel sentences are required to train the ...
Parallel corpora are crucial for statistical machine translation (SMT); however, they are quite scar...
International audienceIn this article, we present a simple and effective approach for extracting bil...