The growth of textual content in various languages and the advancement of automatic translation systems has led to an increase of cases of translated plagiarism. When a text is translated into another language, word order will change and words may be substituted by synonyms, and as a result detection will be more challenging. The purpose of this paper is to introduce a new technique for English-Arabic cross-language plagiarism detection. This method combines word embedding, term weighting techniques, and universal sentence encoder models, in order to improve detection of sentence similarity. The proposed model has been evaluated based on English-Arabic cross-lingual datasets, and experimental results show improved performance when compared ...
International audienceWe present our submitted systems for Semantic Textual Similarity (STS) Track 4...
We propose and experimentally assess Semantic Word Error Rate (SWER), an innovative similarity measu...
Cross-language plagiarism detection deals with the automatic identification and extraction of plagia...
The advancement of the web and information technology has contributed to the rapid growth of digital...
International audienceThis paper proposes to use distributed representation of words (word embedding...
Three reasons make plagiarism across languages to be on the rise: (i) speakers of under-resourced la...
International audienceMeasuring the amount of shared information between two documents is a key to a...
Abstract — Many language-sensitive tools for detecting plagiarism in natural language documents have...
Plagiarism, the unacknowledged reuse of text, does not end at language boundaries. Cross-language pl...
this data collected for research in Arabic-English Cross-Lingual Plagiarism Detectio
With the advent of the Internet and the widespread use of digital documents, access to information f...
Cross-lingual plagiarism occurs when the source (or original) text(s) is in one language and the pla...
Generally utterances in natural language are highly ambiguous, and a unique interpretation can usual...
none4siCross-language plagiarism detection deals with the automatic identification and extraction of...
Part 1: Information and Communication Technology- Eurasia Conference (ICT-EurAsia)International audi...
International audienceWe present our submitted systems for Semantic Textual Similarity (STS) Track 4...
We propose and experimentally assess Semantic Word Error Rate (SWER), an innovative similarity measu...
Cross-language plagiarism detection deals with the automatic identification and extraction of plagia...
The advancement of the web and information technology has contributed to the rapid growth of digital...
International audienceThis paper proposes to use distributed representation of words (word embedding...
Three reasons make plagiarism across languages to be on the rise: (i) speakers of under-resourced la...
International audienceMeasuring the amount of shared information between two documents is a key to a...
Abstract — Many language-sensitive tools for detecting plagiarism in natural language documents have...
Plagiarism, the unacknowledged reuse of text, does not end at language boundaries. Cross-language pl...
this data collected for research in Arabic-English Cross-Lingual Plagiarism Detectio
With the advent of the Internet and the widespread use of digital documents, access to information f...
Cross-lingual plagiarism occurs when the source (or original) text(s) is in one language and the pla...
Generally utterances in natural language are highly ambiguous, and a unique interpretation can usual...
none4siCross-language plagiarism detection deals with the automatic identification and extraction of...
Part 1: Information and Communication Technology- Eurasia Conference (ICT-EurAsia)International audi...
International audienceWe present our submitted systems for Semantic Textual Similarity (STS) Track 4...
We propose and experimentally assess Semantic Word Error Rate (SWER), an innovative similarity measu...
Cross-language plagiarism detection deals with the automatic identification and extraction of plagia...