Code reuse has the benefits of saving time and resources but poses a risk whenattempting to tailor copied code for a new purpose or in cases when such copies arebuggy or otherwise faulty. In the field of data science, the web application JupyterNotebook is a popular tool for creating computational notebooks, documentscontaining both plain text and code snippets, many of which are publicly available oncode hosting sites such as GitHub. This thesis describes the acquisition ofapproximately 2.6 million computational notebooks and analysis of this data set.By hashing the contents of every code snippet, using the MD5 hashing algorithm,cloned snippets were found through snippets producing identical hashes. Bysubsequently mapping the snippets to t...
This repo hosts the slides to the JupyterCon 2023 session "Computational reproducibility of Jupyter ...
Despite substantial study over the past three decades resulting in the development of more than 250 ...
Copying a code fragment and reusing it by pasting with or without minor modifications is a common pr...
Code reuse has the benefits of saving time and resources but poses a risk whenattempting to tailor c...
Code reuse has the benefits of saving time and resources but poses a risk when attempting to tailor ...
Jupyter notebooks has emerged as a standard tool for data science programming. Programs in Jupyter n...
This is a replication package for the paper: "Code Duplication and Reuse in Jupyter Notebooks", whic...
By bringing together code, text, and examples, Jupyter notebooks have become one of the most popular...
Jupyter notebooks-documents that contain live code, equations, visualizations, and narrative text-no...
Background: Software Code Cloning is widely used by developers to produce code in which they have co...
This repository represents a replication package for our analysis on "Identifying Versions of Librar...
In scientific computing and data science, computer programs employing mathematical and statistical m...
The self-documenting aspects and the ability to reproduce results have been touted as significant be...
This work delves into the field of code clone detection, focusing on Python programming language. We...
Similar fragments in source codes are known as clones or duplicated codes. One major issue with dupl...
This repo hosts the slides to the JupyterCon 2023 session "Computational reproducibility of Jupyter ...
Despite substantial study over the past three decades resulting in the development of more than 250 ...
Copying a code fragment and reusing it by pasting with or without minor modifications is a common pr...
Code reuse has the benefits of saving time and resources but poses a risk whenattempting to tailor c...
Code reuse has the benefits of saving time and resources but poses a risk when attempting to tailor ...
Jupyter notebooks has emerged as a standard tool for data science programming. Programs in Jupyter n...
This is a replication package for the paper: "Code Duplication and Reuse in Jupyter Notebooks", whic...
By bringing together code, text, and examples, Jupyter notebooks have become one of the most popular...
Jupyter notebooks-documents that contain live code, equations, visualizations, and narrative text-no...
Background: Software Code Cloning is widely used by developers to produce code in which they have co...
This repository represents a replication package for our analysis on "Identifying Versions of Librar...
In scientific computing and data science, computer programs employing mathematical and statistical m...
The self-documenting aspects and the ability to reproduce results have been touted as significant be...
This work delves into the field of code clone detection, focusing on Python programming language. We...
Similar fragments in source codes are known as clones or duplicated codes. One major issue with dupl...
This repo hosts the slides to the JupyterCon 2023 session "Computational reproducibility of Jupyter ...
Despite substantial study over the past three decades resulting in the development of more than 250 ...
Copying a code fragment and reusing it by pasting with or without minor modifications is a common pr...