In this paper we investigate the potential benefits of Latent Dirichlet Allocation (LDA) as a technique for code clone de-tection. Our objective is to propose a language-independent, effective, and scalable approach for identifying similar code fragments in relatively large software systems. The main assumption is that the latent topic structure of software ar-tifacts gives an indication of the presence of code clones. In particular, we hypothesize that artifacts with similar topic distributions contain duplicated code fragments. To test this novel hypothesis, we conduct an experimental investigation using multiple datasets from difierent application domains. Preliminary results show that, if calibrated properly, topic modeling can deliver ...
Similar fragments in source codes are known as clones or duplicated codes. One major issue with dupl...
Abstract Reusing existing software with or without modications is frequently occurred to develop new...
Maintainability is an important attribute when developing software. One of the factors that negative...
In this thesis work, the potential benefits of Latent Dirichlet Allocation (LDA) as a technique for ...
Copying a code fragment and reusing the same in several parts of the project is a common phenomenon ...
Duplication is detected by comparing features of source fragments. The main problem for the detectio...
Code clone is a portion of codes that contains some similarities in the same software regardless of ...
Code duplication or copying a code fragment and then reuse by pasting with or without any modificati...
Several techniques have been developed for identifying similar code fragments in programs. These sim...
Abstract—Knowledge of similar code fragments, also known as code clones, is important to many softwa...
Despite the fact that duplicated fragments of code also called code clones are considered one of the...
Code clones are syntactically or semantically equivalent code fragments of source code. Copy-and-pas...
Abstract: This paper presents a clone-detection method/tool currently under devel-opment. This tool ...
Abstract- Inconsistent changes to code clones can create faults and, hence, lead to incorrect progra...
Abstract — Clone detection techniques essentially cluster textually, syntactically and/or semantical...
Similar fragments in source codes are known as clones or duplicated codes. One major issue with dupl...
Abstract Reusing existing software with or without modications is frequently occurred to develop new...
Maintainability is an important attribute when developing software. One of the factors that negative...
In this thesis work, the potential benefits of Latent Dirichlet Allocation (LDA) as a technique for ...
Copying a code fragment and reusing the same in several parts of the project is a common phenomenon ...
Duplication is detected by comparing features of source fragments. The main problem for the detectio...
Code clone is a portion of codes that contains some similarities in the same software regardless of ...
Code duplication or copying a code fragment and then reuse by pasting with or without any modificati...
Several techniques have been developed for identifying similar code fragments in programs. These sim...
Abstract—Knowledge of similar code fragments, also known as code clones, is important to many softwa...
Despite the fact that duplicated fragments of code also called code clones are considered one of the...
Code clones are syntactically or semantically equivalent code fragments of source code. Copy-and-pas...
Abstract: This paper presents a clone-detection method/tool currently under devel-opment. This tool ...
Abstract- Inconsistent changes to code clones can create faults and, hence, lead to incorrect progra...
Abstract — Clone detection techniques essentially cluster textually, syntactically and/or semantical...
Similar fragments in source codes are known as clones or duplicated codes. One major issue with dupl...
Abstract Reusing existing software with or without modications is frequently occurred to develop new...
Maintainability is an important attribute when developing software. One of the factors that negative...