Software defect prediction performance varies over a large range. Menzies suggested there is a ceiling effect of 80% Recall [8]. Most of the data sets used are highly imbalanced. This paper asks, what is the empirical effect of using different datasets with varying levels of imbalance on predictive performance? We use data synthesised by a previous meta-analysis of 600 fault prediction models and their results. Four model evaluation measures (the Mathews Correlation Coeficient (MCC), F-Measure, Precision and Re- call ) are compared to the corresponding data imbalance ratio. When the data are imbalanced, the predictive performance of software defect prediction studies is low. As the data become more balanced, the predictive performance of pr...
The ongoing development of computer systems requires massive software projects. Running the componen...
Context: Software defect prediction plays a crucial role in estimating the most defect-prone compone...
The performance of software defect prediction(SDP) models is known to be dependent on the datasets u...
Defect models that are trained on class imbalanced datasets (i.e., the proportion of defective and c...
Software defect prediction research has adopted various evaluation measures to assess the performanc...
Imbalanced data is a common problem in data mining when dealing with classi cation problems, where ...
Abstract—Defect prediction models help software quality as-surance teams to effectively allocate the...
With the availability of high-speed Internet and the advent of Internet of Things devices, modern so...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
In evaluating the performance of software defect prediction models, accuracy measures such as precis...
Cross-project defect prediction (CPDP), where data from different software projects are used to pred...
ISSREW 2016 : 2016 IEEE International Symposium on Software Reliability Engineering Workshops, 23-27...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
Software defect prediction strives to improve software quality and testing efficiency by constructin...
During the last 10 years, hundreds of different defect prediction models have been published. The pe...
The ongoing development of computer systems requires massive software projects. Running the componen...
Context: Software defect prediction plays a crucial role in estimating the most defect-prone compone...
The performance of software defect prediction(SDP) models is known to be dependent on the datasets u...
Defect models that are trained on class imbalanced datasets (i.e., the proportion of defective and c...
Software defect prediction research has adopted various evaluation measures to assess the performanc...
Imbalanced data is a common problem in data mining when dealing with classi cation problems, where ...
Abstract—Defect prediction models help software quality as-surance teams to effectively allocate the...
With the availability of high-speed Internet and the advent of Internet of Things devices, modern so...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
In evaluating the performance of software defect prediction models, accuracy measures such as precis...
Cross-project defect prediction (CPDP), where data from different software projects are used to pred...
ISSREW 2016 : 2016 IEEE International Symposium on Software Reliability Engineering Workshops, 23-27...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
Software defect prediction strives to improve software quality and testing efficiency by constructin...
During the last 10 years, hundreds of different defect prediction models have been published. The pe...
The ongoing development of computer systems requires massive software projects. Running the componen...
Context: Software defect prediction plays a crucial role in estimating the most defect-prone compone...
The performance of software defect prediction(SDP) models is known to be dependent on the datasets u...