We address the problem of automatically cleaning a large-scale Translation Memory (TM) in a fully unsupervised fashion, i.e. without human-labelled data. We approach the task by: i) designing a set of features that capture the similarity between two text segments in different languages, ii) use them to induce reliable training labels for a subset of the translation units (TUs) contained in the TM, and iii) use the automatically labelled data to train an ensemble of binary classifiers. We apply our method to clean a test set composed of 1,000 TUs randomly extracted from the English-Italian version of MyMemory, the world’s largest public TM. Our results show competitive performance not only against a strong baseline that exp...
This paper explores the use of machine translation (MT) to help users of computer-aided translation ...
Matching and retrieving previously translated segments from the Translation Memory is a key function...
�� 2020 ACL. This is an open access article available under a Creative Commons licence. The publishe...
We present TMop, the first open-source tool for automatic Translation Memory (TM) cleaning. The to...
We address the problem of automatically cleaning a translation memory (TM) by identifying problemati...
This paper summarizes the work done to prepare the first shared task on automatic tran...
This is an accepted manuscript of an article published by Springer in Machine Translation on 21/01/2...
This paper explores the use of general-purpose machine translation (MT) in assisting the users of co...
In this paper we present a novel way of integrating Translation Memory into an Example-based Machine...
In this paper we address the problem of automatic acquisition of a human-oriented translation dictio...
In this paper, we propose a novel frame- work to enrich Translation Memory (TM) systems with Statist...
We propose the design, implementation and evaluation of a novel and modular approach to boost the tr...
Translation Memory (TM) systems are one of the most widely used translation technologies. An importa...
Translation Memory (TM) systems are one of the most widely used translation technologies. An importa...
We propose a translation recommendation framework to integrate Statistical Machine Translation (SMT)...
This paper explores the use of machine translation (MT) to help users of computer-aided translation ...
Matching and retrieving previously translated segments from the Translation Memory is a key function...
�� 2020 ACL. This is an open access article available under a Creative Commons licence. The publishe...
We present TMop, the first open-source tool for automatic Translation Memory (TM) cleaning. The to...
We address the problem of automatically cleaning a translation memory (TM) by identifying problemati...
This paper summarizes the work done to prepare the first shared task on automatic tran...
This is an accepted manuscript of an article published by Springer in Machine Translation on 21/01/2...
This paper explores the use of general-purpose machine translation (MT) in assisting the users of co...
In this paper we present a novel way of integrating Translation Memory into an Example-based Machine...
In this paper we address the problem of automatic acquisition of a human-oriented translation dictio...
In this paper, we propose a novel frame- work to enrich Translation Memory (TM) systems with Statist...
We propose the design, implementation and evaluation of a novel and modular approach to boost the tr...
Translation Memory (TM) systems are one of the most widely used translation technologies. An importa...
Translation Memory (TM) systems are one of the most widely used translation technologies. An importa...
We propose a translation recommendation framework to integrate Statistical Machine Translation (SMT)...
This paper explores the use of machine translation (MT) to help users of computer-aided translation ...
Matching and retrieving previously translated segments from the Translation Memory is a key function...
�� 2020 ACL. This is an open access article available under a Creative Commons licence. The publishe...