Several program analysis tools—such as plagiarism detec-tion and bug finding—rely on knowing a piece of code’s relative semantic importance. For example, a plagiarism de-tector should not bother reporting two programs that have an identical simple loop counter test, but should report pro-grams that share more distinctive code. Traditional program analysis techniques (e.g., finding data and control dependen-cies) are useful, but do not say how surprising or common a line of code is. Natural language processing researchers have encountered a similar problem and addressed it using an n-gram model of text frequency, derived from statistics computed over text corpora. We propose and compute an n-gram model for program-ming languages, computed ov...
Abstract. The 60-year-old dream of computational linguistics is to make computers capable of communi...
A frequently occurring usage of program elements in a pro-gramming language and software libraries i...
The simple access to texts on digital libraries and the WWW has led to an increased number of plagia...
International audienceOur work focuses on detecting plagiarism within a source code corpus. The case...
Along with the blossom of open source projects comes the convenience for software plagiarism. A comp...
The reliability of plagiarism detection systems, which try to identify similar programs in large pop...
Plagiarism is a serious problem among university students and that needs to be taken care of. In or...
Unauthorized re-use of code by students is a widespread problem in academic institutions, and raises...
We propose a detection method for plagiarised source code in programs written by students. The purp...
The pervasive connectivity of the Internet has contributed to increased productivity and plagiarism ...
Unigram is a fundamental element of n-gram in natural language processing. However, unigrams collect...
The act of source code plagiarism is an academic offense that discourages the learning habits of stu...
JPlag is a web service that finds pairs of similar programs among a given set of programs. It has su...
Plagiarism is a growing problem in academia. Academics often use plagiarism detection tools to detec...
Detecting code reuse in malicious software is compli-cated by the lack of source code. The same circ...
Abstract. The 60-year-old dream of computational linguistics is to make computers capable of communi...
A frequently occurring usage of program elements in a pro-gramming language and software libraries i...
The simple access to texts on digital libraries and the WWW has led to an increased number of plagia...
International audienceOur work focuses on detecting plagiarism within a source code corpus. The case...
Along with the blossom of open source projects comes the convenience for software plagiarism. A comp...
The reliability of plagiarism detection systems, which try to identify similar programs in large pop...
Plagiarism is a serious problem among university students and that needs to be taken care of. In or...
Unauthorized re-use of code by students is a widespread problem in academic institutions, and raises...
We propose a detection method for plagiarised source code in programs written by students. The purp...
The pervasive connectivity of the Internet has contributed to increased productivity and plagiarism ...
Unigram is a fundamental element of n-gram in natural language processing. However, unigrams collect...
The act of source code plagiarism is an academic offense that discourages the learning habits of stu...
JPlag is a web service that finds pairs of similar programs among a given set of programs. It has su...
Plagiarism is a growing problem in academia. Academics often use plagiarism detection tools to detec...
Detecting code reuse in malicious software is compli-cated by the lack of source code. The same circ...
Abstract. The 60-year-old dream of computational linguistics is to make computers capable of communi...
A frequently occurring usage of program elements in a pro-gramming language and software libraries i...
The simple access to texts on digital libraries and the WWW has led to an increased number of plagia...