This dataset contains the repository data used for our study "A Large-Scale Study of Modern Code Review and Security in Open Source Projects". This dataset was collected from GitHub, and includes 3,126 projects in 143 languages, with 489,038 issues and 382,771 pull requests. We also include the regression analysis notebooks for reproducing our results from this data.Our main analysis (as reported in our paper) is contained in the Jupyter notebook `Regression.ipynb`. To run it, you need an active Jupyter instance running with the R kernel. We ran these analyses using R version 3.3.1 and the `ggplot2`, `reshape2`, `plyr`, `car`, `tibble`, and `ggfortify` packages. Full system details are available at the bottom of each notebook.Additionally,...
<div>This dataset contains source code reviews of 51 projects mined from Gerrit (14 projects, ~133K ...
Handouts of the following technical briefing. Georgios Gousios and Diomidis Spinellis. Mining softw...
Research software has opened up new pathways of discovery in many and diverse disciplines. The resea...
Modern code review is a lightweight and informal process for integrating changes into a software pro...
Research software is vital for academia, yet reliable figures are rare. In an attempt to better unde...
With over 10 million git repositories, GitHub is becoming one of the most important source of softwa...
Both datasets are part of a master's thesis at Carleton University, Canada. The first dataset, PR C...
GitHub currently hosts more than 100 million public repositories. This has made it very popular to c...
Phabricator is a modern code collaboration tool used by popular projects like FreeBSD and Mozilla. H...
Code review is an important practice that improves the overall quality of a proposed patch (i.e. cod...
In recent years, GitHub has become the largest code host in the world, with more than 5M developers ...
The quality of code can be measured using source code metrics. Looking at the trends of these metric...
GitHub is arguably the most influential OSS version control system currently available. It is utiliz...
This dataset, which is composed of two parts: the code review data collected from OpenStack and Qt, ...
The popularity of the software repository site GitHub has created a rise in the Pull Based Developme...
<div>This dataset contains source code reviews of 51 projects mined from Gerrit (14 projects, ~133K ...
Handouts of the following technical briefing. Georgios Gousios and Diomidis Spinellis. Mining softw...
Research software has opened up new pathways of discovery in many and diverse disciplines. The resea...
Modern code review is a lightweight and informal process for integrating changes into a software pro...
Research software is vital for academia, yet reliable figures are rare. In an attempt to better unde...
With over 10 million git repositories, GitHub is becoming one of the most important source of softwa...
Both datasets are part of a master's thesis at Carleton University, Canada. The first dataset, PR C...
GitHub currently hosts more than 100 million public repositories. This has made it very popular to c...
Phabricator is a modern code collaboration tool used by popular projects like FreeBSD and Mozilla. H...
Code review is an important practice that improves the overall quality of a proposed patch (i.e. cod...
In recent years, GitHub has become the largest code host in the world, with more than 5M developers ...
The quality of code can be measured using source code metrics. Looking at the trends of these metric...
GitHub is arguably the most influential OSS version control system currently available. It is utiliz...
This dataset, which is composed of two parts: the code review data collected from OpenStack and Qt, ...
The popularity of the software repository site GitHub has created a rise in the Pull Based Developme...
<div>This dataset contains source code reviews of 51 projects mined from Gerrit (14 projects, ~133K ...
Handouts of the following technical briefing. Georgios Gousios and Diomidis Spinellis. Mining softw...
Research software has opened up new pathways of discovery in many and diverse disciplines. The resea...