Code reuse has the benefits of saving time and resources but poses a risk when attempting to tailor copied code for a new purpose or in cases when such copies are buggy or otherwise faulty. In the field of data science, the web application JupyterNotebook is a popular tool for creating computational notebooks, documents containing both plain text and code snippets, many of which are publicly available on code hosting sites such as GitHub. This thesis describes the acquisition of approximately 2.6 million computational notebooks and analysis of this data set.By hashing the contents of every code snippet, using the MD5 hashing algorithm,cloned snippets were found through snippets producing identical hashes. By subsequently mapping the snippet...
Similar fragments in source codes are known as clones or duplicated codes. One major issue with dupl...
Code Clones, also known as Software Clones are similar code fragments mostly formed due to reuse of ...
Code clones have recieved great interests in recent years from many researchers, engineers, and prac...
Code reuse has the benefits of saving time and resources but poses a risk when attempting to tailor ...
Code reuse has the benefits of saving time and resources but poses a risk whenattempting to tailor c...
Jupyter notebooks has emerged as a standard tool for data science programming. Programs in Jupyter n...
This is a replication package for the paper: "Code Duplication and Reuse in Jupyter Notebooks", whic...
By bringing together code, text, and examples, Jupyter notebooks have become one of the most popular...
Background: Software Code Cloning is widely used by developers to produce code in which they have co...
In scientific computing and data science, computer programs employing mathematical and statistical m...
Jupyter notebooks-documents that contain live code, equations, visualizations, and narrative text-no...
The self-documenting aspects and the ability to reproduce results have been touted as significant be...
This work delves into the field of code clone detection, focusing on Python programming language. We...
This repo hosts the slides to the JupyterCon 2023 session "Computational reproducibility of Jupyter ...
Copying a code fragment and reusing it by pasting with or without minor modifications is a common pr...
Similar fragments in source codes are known as clones or duplicated codes. One major issue with dupl...
Code Clones, also known as Software Clones are similar code fragments mostly formed due to reuse of ...
Code clones have recieved great interests in recent years from many researchers, engineers, and prac...
Code reuse has the benefits of saving time and resources but poses a risk when attempting to tailor ...
Code reuse has the benefits of saving time and resources but poses a risk whenattempting to tailor c...
Jupyter notebooks has emerged as a standard tool for data science programming. Programs in Jupyter n...
This is a replication package for the paper: "Code Duplication and Reuse in Jupyter Notebooks", whic...
By bringing together code, text, and examples, Jupyter notebooks have become one of the most popular...
Background: Software Code Cloning is widely used by developers to produce code in which they have co...
In scientific computing and data science, computer programs employing mathematical and statistical m...
Jupyter notebooks-documents that contain live code, equations, visualizations, and narrative text-no...
The self-documenting aspects and the ability to reproduce results have been touted as significant be...
This work delves into the field of code clone detection, focusing on Python programming language. We...
This repo hosts the slides to the JupyterCon 2023 session "Computational reproducibility of Jupyter ...
Copying a code fragment and reusing it by pasting with or without minor modifications is a common pr...
Similar fragments in source codes are known as clones or duplicated codes. One major issue with dupl...
Code Clones, also known as Software Clones are similar code fragments mostly formed due to reuse of ...
Code clones have recieved great interests in recent years from many researchers, engineers, and prac...