This paper presents the creation of a parallel corpus of Italian and German legal documents which are translations of one another. The corpus, which contains approximately 5 mio. words, is primarily intended as a resource for (semi-)automatic terminology acquisition. The guidelines of the Corpus Encoding Standard have been applied for encoding structural information, segmentation information, and sentence alignment. Since the parallel texts have a one-to-one correspondence on the sentence level, building a perfect sentence alignment is rather straightforward. As a result of this the corpus constitutes also a valuable testbed for the evaluation of alignment algorithms. The paper discusses the intended use of the corpus, the various phases of...
In this paper we first give an overview of parallel corpus annotation, alignment and retrieval. We p...
I corpora paralleli rappresentano un'importanza assoluta per tante applicazioni della linguistica co...
In this paper, we will introduce our project for enlarging the Italian-Russian parallel corpus of th...
The use of corpus linguistics for technical translations has largely been advocated by scholars over...
This paper focuses on the description of the corpus «PEST-INTER» in five languages and the process o...
In an increasingly globalized world, being able to understand texts in different languages (even mor...
Recent technological advances have determined an increase in the use of parallel corpus data among t...
Parallel corpora have long been recognized as an indispensable source of term candidates for buildin...
The aim of this paper is to present LEX.CH.IT, a corpus for micro-diachronic linguistic investigatio...
the study presents a corpus-based analysis of lexico-grammatical features of Italian legal lay-langu...
We present a new, unique and freely available parallel corpus containing European Union (EU) documen...
In this paper, we present a trilingual parallel corpus for German, Italian and Romansh, a Swiss mino...
The analysis of special multilingual corpora is still in its infancy, but it may serve a particularl...
In this paper we present our approach to extract multi-word terms (MWTs) from an Italian-Arabic para...
The European Union institutions are increasingly present in the lives of European citizens, particul...
In this paper we first give an overview of parallel corpus annotation, alignment and retrieval. We p...
I corpora paralleli rappresentano un'importanza assoluta per tante applicazioni della linguistica co...
In this paper, we will introduce our project for enlarging the Italian-Russian parallel corpus of th...
The use of corpus linguistics for technical translations has largely been advocated by scholars over...
This paper focuses on the description of the corpus «PEST-INTER» in five languages and the process o...
In an increasingly globalized world, being able to understand texts in different languages (even mor...
Recent technological advances have determined an increase in the use of parallel corpus data among t...
Parallel corpora have long been recognized as an indispensable source of term candidates for buildin...
The aim of this paper is to present LEX.CH.IT, a corpus for micro-diachronic linguistic investigatio...
the study presents a corpus-based analysis of lexico-grammatical features of Italian legal lay-langu...
We present a new, unique and freely available parallel corpus containing European Union (EU) documen...
In this paper, we present a trilingual parallel corpus for German, Italian and Romansh, a Swiss mino...
The analysis of special multilingual corpora is still in its infancy, but it may serve a particularl...
In this paper we present our approach to extract multi-word terms (MWTs) from an Italian-Arabic para...
The European Union institutions are increasingly present in the lives of European citizens, particul...
In this paper we first give an overview of parallel corpus annotation, alignment and retrieval. We p...
I corpora paralleli rappresentano un'importanza assoluta per tante applicazioni della linguistica co...
In this paper, we will introduce our project for enlarging the Italian-Russian parallel corpus of th...