We study the evolution of the largest known corpus of publicly available source code, i.e., the Software Heritage archive (4B unique source code files, 1B commits capturing their development histories across 50M software projects). On such corpus we quantify the growth rate of original, never-seen-before source code files and commits. We find the growth rates to be exponential over a period of more than 40 years.We then estimate the multiplication factor, i.e., how much the same artifacts (e.g., files or commits) appear in different contexts (e.g., commits or source code distribution places). We observe a combinatorial explosion in the multiplication of identical source code files across different commits.We discuss the implication of these...
Code clone genealogies show how clone groups evolve with the evolution of the associated software sy...
Assertions about quality, reliability, or trustworthiness of software systems are important for many...
The concept of provenance is well known by some, and completely unknown to others. In this talk I gi...
We study the evolution of the largest known corpus of publicly available source code, i.e., the Soft...
International audienceWe study the possibilities to track provenance of software source code artifac...
International audienceSoftware Heritage is the largest existing public archive of software source co...
International audienceSoftware, and software source code in particular, is widely used in modern res...
Software Heritage is the largest existing public archive of software source code and accompanying de...
International audienceSoftware is now a key component present in all aspects of our society. Its pre...
Today’s software development processes are complex. A lot of interaction occurs between developers, ...
International audienceAmong the entities involved in software citation, software source code require...
International audienceSoftware is becoming the fabric that binds our personal and social lives, embo...
International audienceVersion Control Systems are key elements of modern software development. They ...
A lot of popular software is developed using a version control system. Historically systems such as ...
Software repositories contain much information besides the source code itself. For Open Source proj...
Code clone genealogies show how clone groups evolve with the evolution of the associated software sy...
Assertions about quality, reliability, or trustworthiness of software systems are important for many...
The concept of provenance is well known by some, and completely unknown to others. In this talk I gi...
We study the evolution of the largest known corpus of publicly available source code, i.e., the Soft...
International audienceWe study the possibilities to track provenance of software source code artifac...
International audienceSoftware Heritage is the largest existing public archive of software source co...
International audienceSoftware, and software source code in particular, is widely used in modern res...
Software Heritage is the largest existing public archive of software source code and accompanying de...
International audienceSoftware is now a key component present in all aspects of our society. Its pre...
Today’s software development processes are complex. A lot of interaction occurs between developers, ...
International audienceAmong the entities involved in software citation, software source code require...
International audienceSoftware is becoming the fabric that binds our personal and social lives, embo...
International audienceVersion Control Systems are key elements of modern software development. They ...
A lot of popular software is developed using a version control system. Historically systems such as ...
Software repositories contain much information besides the source code itself. For Open Source proj...
Code clone genealogies show how clone groups evolve with the evolution of the associated software sy...
Assertions about quality, reliability, or trustworthiness of software systems are important for many...
The concept of provenance is well known by some, and completely unknown to others. In this talk I gi...