The development of models for automatic detection of text re-use and plagiarism across languages has received increasing attention in recent years. However, the lack of an evaluation framework composed of annotated datasets has caused these efforts to be isolated. In this paper we present the CL!TR 2011 corpus, the first manually created corpus for the analysis of cross-language text re-use between English and Hindi. The corpus was used during the Cross-Language !ndian Text Re-Use Detection Competition. Here we overview the approaches applied the contestants and evaluate their quality when detecting a re-used text together with its source
Internet has made available huge amounts of information, also source code. Source code repositories...
none4siCross-language plagiarism detection deals with the automatic identification and extraction of...
Cross-language plagiarism detection deals with the automatic identification and extraction of plagia...
The development of models for automatic detection of text re-use and plagiarism across languages has...
The development of models for automatic detection of text re-use and plagiarism across languages has...
Text reuse occurs when one borrows the text (either verbatim or paraphrased) from an earlier written...
Text reuse is becoming a serious issue in many fields and research shows that it is much harder to d...
Cross-lingual plagiarism occurs when the source (or original) text(s) is in one language and the pla...
The evaluation dataset for the cross-lingual text reuse detection task.The dataset was prepared for ...
Text reuse is the act of borrowing text (either verbatim or paraphrased) from an earlier written tex...
Text reuse is the act of borrowing text from existing documents to create new texts. Freely availabl...
Plagiarism, the unacknowledged reuse of text, does not end at language boundaries. Cross-language pl...
In recent years, the problem of Cross-Lingual Text Reuse Detection (CLTRD) has gained the interest o...
Three reasons make plagiarism across languages to be on the rise: (i) speakers of under-resourced la...
Nowadays, Internet is the main source to get information from blogs, encyclopedias, discussion forum...
Internet has made available huge amounts of information, also source code. Source code repositories...
none4siCross-language plagiarism detection deals with the automatic identification and extraction of...
Cross-language plagiarism detection deals with the automatic identification and extraction of plagia...
The development of models for automatic detection of text re-use and plagiarism across languages has...
The development of models for automatic detection of text re-use and plagiarism across languages has...
Text reuse occurs when one borrows the text (either verbatim or paraphrased) from an earlier written...
Text reuse is becoming a serious issue in many fields and research shows that it is much harder to d...
Cross-lingual plagiarism occurs when the source (or original) text(s) is in one language and the pla...
The evaluation dataset for the cross-lingual text reuse detection task.The dataset was prepared for ...
Text reuse is the act of borrowing text (either verbatim or paraphrased) from an earlier written tex...
Text reuse is the act of borrowing text from existing documents to create new texts. Freely availabl...
Plagiarism, the unacknowledged reuse of text, does not end at language boundaries. Cross-language pl...
In recent years, the problem of Cross-Lingual Text Reuse Detection (CLTRD) has gained the interest o...
Three reasons make plagiarism across languages to be on the rise: (i) speakers of under-resourced la...
Nowadays, Internet is the main source to get information from blogs, encyclopedias, discussion forum...
Internet has made available huge amounts of information, also source code. Source code repositories...
none4siCross-language plagiarism detection deals with the automatic identification and extraction of...
Cross-language plagiarism detection deals with the automatic identification and extraction of plagia...