Context The SZZ algorithm is the de facto standard for labeling bug fixing commits and finding inducing changes for defect prediction data. Recent research uncovered potential problems in different parts of the SZZ algorithm. Most defect prediction data sets provide only static code metrics as features, while research indicates that other features are also important. Objective We provide an empirical analysis of the defect labels created with the SZZ algorithm and the impact of commonly used features on results. Method We used a combination of manual validation and adopted or improved heuristics for the collection of defect data. We conducted an empirical study on 398 releases of 38 Apache projects. Results We found that only half of...
Many software defect prediction models have been built using historical defect data obtained by mini...
In defect prediction studies, open-source and real-world defect data sets are frequently used. The q...
During the last 10 years, hundreds of different defect prediction models have been published. The p...
Defect prediction aims at identifying software artifacts that are likely to exhibit a defect. The ma...
Two recent studies explicitly recommend labeling defective classes in releases using the affected ve...
Defect prediction models can be beneficial to prioritize testing, analysis, or code review activitie...
Data from software repositories are a very useful asset to building dierent kinds of models and reco...
Dataset used for paper "Issues-Driven Features for Software Fault Prediction". The dataset...
This is the replication package for our article "Problems with SZZ and Features: An empirical study ...
The accurate identification of defect-inducing commits representsa key problem for researchers inter...
Open Access: This article is distributed under the terms of the Creative Commons Attribution 4.0 Int...
In order to develop and train defect prediction models, researchers rely on datasets in which a defe...
Reliably predicting software defects is one of the holy grails of software engineering. Researchers ...
During the last 10 years, hundreds of different defect prediction models have been published. The pe...
Abstract—The reliability of a prediction model depends on the quality of the data from which it was ...
Many software defect prediction models have been built using historical defect data obtained by mini...
In defect prediction studies, open-source and real-world defect data sets are frequently used. The q...
During the last 10 years, hundreds of different defect prediction models have been published. The p...
Defect prediction aims at identifying software artifacts that are likely to exhibit a defect. The ma...
Two recent studies explicitly recommend labeling defective classes in releases using the affected ve...
Defect prediction models can be beneficial to prioritize testing, analysis, or code review activitie...
Data from software repositories are a very useful asset to building dierent kinds of models and reco...
Dataset used for paper "Issues-Driven Features for Software Fault Prediction". The dataset...
This is the replication package for our article "Problems with SZZ and Features: An empirical study ...
The accurate identification of defect-inducing commits representsa key problem for researchers inter...
Open Access: This article is distributed under the terms of the Creative Commons Attribution 4.0 Int...
In order to develop and train defect prediction models, researchers rely on datasets in which a defe...
Reliably predicting software defects is one of the holy grails of software engineering. Researchers ...
During the last 10 years, hundreds of different defect prediction models have been published. The pe...
Abstract—The reliability of a prediction model depends on the quality of the data from which it was ...
Many software defect prediction models have been built using historical defect data obtained by mini...
In defect prediction studies, open-source and real-world defect data sets are frequently used. The q...
During the last 10 years, hundreds of different defect prediction models have been published. The p...