ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction This archive contains the ApacheJIT dataset presented in the paper "ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction" as well as the replication package. The paper is submitted to MSR 2022 Data Showcase Track. The datasets are available under directory dataset. There are 4 datasets in this directory. apachejit_total.csv: This file contains the entire dataset. Commits are specified by their identifier and a set of commit metrics that are explained in the paper are provided as features. Column buggy specifies whether or not the commit introduced any bug into the system. apachejit_train.csv: This file is a subset of the entire dataset. It provides a balanced set t...
A total of 22 software defect datasets with format ARFF. AEEEM was collected by D’Ambros et al. [1]...
This is the dataset for the publication "On the differences between quality increasing and other cha...
This is the replication package for the paper "AI-based Fault-proneness Metrics for Source Code Chan...
ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction This archive contains the ApacheJIT d...
We have meticulously prepared this comprehensive replication package to facilitate further investiga...
Defect prediction has been a major research problem in the software engineering domain for the last ...
Previous researchers conducting Just-In-Time (JIT) defect prediction tasks have primarily focused on...
This is a collection of defect datasets for the software engineering research community. This collec...
Finding defects in proposed changes is one of the biggest motivations and expected outcomes of code ...
Dataset used for paper "Issues-Driven Features for Software Fault Prediction". The dataset...
Context: Defect prediction research is based on a small number of defect datasets and most are at cl...
The number of research papers on defect prediction has sharply increased for the last decade or so. ...
This is the replication package for our article "Problems with SZZ and Features: An empirical study ...
Note: Please find the dockerized version of this replication package in the following link: https:/...
Software quality assurance efforts often focus on identifying defective code. To find likely defecti...
A total of 22 software defect datasets with format ARFF. AEEEM was collected by D’Ambros et al. [1]...
This is the dataset for the publication "On the differences between quality increasing and other cha...
This is the replication package for the paper "AI-based Fault-proneness Metrics for Source Code Chan...
ApacheJIT: A Large Dataset for Just-In-Time Defect Prediction This archive contains the ApacheJIT d...
We have meticulously prepared this comprehensive replication package to facilitate further investiga...
Defect prediction has been a major research problem in the software engineering domain for the last ...
Previous researchers conducting Just-In-Time (JIT) defect prediction tasks have primarily focused on...
This is a collection of defect datasets for the software engineering research community. This collec...
Finding defects in proposed changes is one of the biggest motivations and expected outcomes of code ...
Dataset used for paper "Issues-Driven Features for Software Fault Prediction". The dataset...
Context: Defect prediction research is based on a small number of defect datasets and most are at cl...
The number of research papers on defect prediction has sharply increased for the last decade or so. ...
This is the replication package for our article "Problems with SZZ and Features: An empirical study ...
Note: Please find the dockerized version of this replication package in the following link: https:/...
Software quality assurance efforts often focus on identifying defective code. To find likely defecti...
A total of 22 software defect datasets with format ARFF. AEEEM was collected by D’Ambros et al. [1]...
This is the dataset for the publication "On the differences between quality increasing and other cha...
This is the replication package for the paper "AI-based Fault-proneness Metrics for Source Code Chan...