The dataset that was made by downloading top 500 starred Java projects from GitHub and then eliminating common projects found with java-large-training and java-large-testing raw datasets*. The resulting dataset consists of 155 GitHub projects. The repositories were downloaded and the code analyzed using code found in the following repository: https://github.com/serg-ml4se-2019/group5-deep-bugs/tree/master (more specifically, the code found in bug_mining folder) * java-large raw dataset can be found at https://github.com/tech-srl/code2seq/blob/master/README.md#dataset
This data set will be released as part of the following publication. "Root cause prediction based on...
Repository mining of bug fixes from version control systems like GitHub is a challenging problem as ...
Examining software ecosystems can provide the research community with data regarding artifacts, proc...
The dataset that was made by downloading top 500 starred Java projects from GitHub and then eliminat...
This is the dataset for the study of Potential Code Borrowing and License Violations in Java Project...
A Public Unified Bug Dataset for Java and its Assessment Regarding Metrics and Bug Prediction. Onli...
The ManySStuBs4J corpus is a collection of simple fixes to Java bugs, designed for evaluating progra...
Dataset used for paper "Issues-Driven Features for Software Fault Prediction". The dataset...
<p>This dataset contains bug reports, commit history, and API descriptions of six open source Java p...
One of the important aims of the continuous software development process is to localize and remove a...
The dataset comprises code changes made to 15 Java Open-Source projects, classified with sentiment v...
Proceedings of the 26th IEEE International Conference on Software Analysis, Evolution and Reengineer...
Identifying and minimizing the number of bugs before release is a high priority of any team working ...
For creating, optimizing, and evaluating our statistical model, we used the Public Unified Bug Datas...
About the Data They download Herzig et al.’s datasets which included the identiers of issue reports...
This data set will be released as part of the following publication. "Root cause prediction based on...
Repository mining of bug fixes from version control systems like GitHub is a challenging problem as ...
Examining software ecosystems can provide the research community with data regarding artifacts, proc...
The dataset that was made by downloading top 500 starred Java projects from GitHub and then eliminat...
This is the dataset for the study of Potential Code Borrowing and License Violations in Java Project...
A Public Unified Bug Dataset for Java and its Assessment Regarding Metrics and Bug Prediction. Onli...
The ManySStuBs4J corpus is a collection of simple fixes to Java bugs, designed for evaluating progra...
Dataset used for paper "Issues-Driven Features for Software Fault Prediction". The dataset...
<p>This dataset contains bug reports, commit history, and API descriptions of six open source Java p...
One of the important aims of the continuous software development process is to localize and remove a...
The dataset comprises code changes made to 15 Java Open-Source projects, classified with sentiment v...
Proceedings of the 26th IEEE International Conference on Software Analysis, Evolution and Reengineer...
Identifying and minimizing the number of bugs before release is a high priority of any team working ...
For creating, optimizing, and evaluating our statistical model, we used the Public Unified Bug Datas...
About the Data They download Herzig et al.’s datasets which included the identiers of issue reports...
This data set will be released as part of the following publication. "Root cause prediction based on...
Repository mining of bug fixes from version control systems like GitHub is a challenging problem as ...
Examining software ecosystems can provide the research community with data regarding artifacts, proc...