This thesis develops a robust inventory of large-scale lattice rescoring methods that improve the quality of statistical machine translation. These rescoring methods include (i) sentence-specific, high-order language models estimated over multi-billion word corpora, (ii) stochastic segmentation transducers that model the phrasal segmentation process in phrase-based SMT, (iii) efficient large-scale lattice minimum Bayes-risk decoding procedures based on weighted path counting transducers, (iv) multi-input and multi-source lattice combination techniques that synthesise multiple sources of translation knowledge, and (v) a novel decoding framework based on segmentation of a word lattice into regions of high and low confidence that supports targ...
© 2014 Association for Computational Linguistics. The combinatorial space of translation derivations...
Minimum Error Rate Training (MERT) and Minimum Bayes-Risk (MBR) decod-ing are used in most current s...
Statistical machine translation relies heavily on parallel corpora to train its models for transla...
Lattice decoding in statistical machine translation (SMT) is useful in speech translation and in the...
Morphological segmentation is an effec-tive sparsity reduction strategy for statis-tical machine tra...
Simple and Efficient Model Filtering in Statistical Machine Translation Data availability and distri...
This paper presents MISTRAL, an open source statistical machine translation decoder dedicated to spo...
Abstract Data availability and distributed computing techniques have allowed statistical machine tra...
We propose a novel statistical machine translation de-coding algorithm for speech translation to imp...
Multi-source statistical machine transla-tion is the process of generating a single translation from...
Statistical machine translation (SMT) requires a parallel corpus between the source and target langu...
Statistical machine translation is based on the idea to extract information from bilingual corpora, ...
In the last decade, while statistical machine translation has advanced significantly, there is still...
In this paper we describe the statistical machine transla-tion system developed at ITI/UPV, which ai...
2014-07-28The goal of machine translation is to translate from one natural language into another usi...
© 2014 Association for Computational Linguistics. The combinatorial space of translation derivations...
Minimum Error Rate Training (MERT) and Minimum Bayes-Risk (MBR) decod-ing are used in most current s...
Statistical machine translation relies heavily on parallel corpora to train its models for transla...
Lattice decoding in statistical machine translation (SMT) is useful in speech translation and in the...
Morphological segmentation is an effec-tive sparsity reduction strategy for statis-tical machine tra...
Simple and Efficient Model Filtering in Statistical Machine Translation Data availability and distri...
This paper presents MISTRAL, an open source statistical machine translation decoder dedicated to spo...
Abstract Data availability and distributed computing techniques have allowed statistical machine tra...
We propose a novel statistical machine translation de-coding algorithm for speech translation to imp...
Multi-source statistical machine transla-tion is the process of generating a single translation from...
Statistical machine translation (SMT) requires a parallel corpus between the source and target langu...
Statistical machine translation is based on the idea to extract information from bilingual corpora, ...
In the last decade, while statistical machine translation has advanced significantly, there is still...
In this paper we describe the statistical machine transla-tion system developed at ITI/UPV, which ai...
2014-07-28The goal of machine translation is to translate from one natural language into another usi...
© 2014 Association for Computational Linguistics. The combinatorial space of translation derivations...
Minimum Error Rate Training (MERT) and Minimum Bayes-Risk (MBR) decod-ing are used in most current s...
Statistical machine translation relies heavily on parallel corpora to train its models for transla...