Software defect prediction research has adopted various evaluation measures to assess the performance of prediction models. In this paper, we further stress on the importance of the choice of appropriate measures in order to correctly assess strengths and weaknesses of a given defect prediction model, especially given that most of the defect prediction tasks suffer from data imbalance. Investigating 111 previous studies published between 2010 and 2020, we found out that over a half either use only one evaluation measure, which alone cannot express all the characteristics of model performance in presence of imbalanced data, or a set of binary measures which are prone to be biased when used to assess models especially when trained with imbal...
Background: The software industry spends a lot of money on finding and fixing defects. It utilises ...
AbstractSoftware defect prediction models are classifiers often built by setting a threshold t on a ...
Defect models that are trained on class imbalanced datasets (i.e., the proportion of defective and c...
Reliably predicting software defects is one of the holy grails of software engineering. Researchers ...
Context. Reports suggest that defects in code cost the US in excess of $50billion per year to put ri...
During the last 10 years, hundreds of different defect prediction models have been published. The pe...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
Software defect prediction performance varies over a large range. Menzies suggested there is a ceili...
Open Access: This article is distributed under the terms of the Creative Commons Attribution 4.0 Int...
During the last 10 years, hundreds of different defect prediction models have been published. The p...
Abstract—Defect prediction models help software quality as-surance teams to effectively allocate the...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
Software defect prediction is motivated by the huge costs incurred as a result of software failures...
Software defect prediction strives to improve software quality and testing efficiency by constructin...
National Key Basic Research Program of China [2018YFB1004401]; the National Natural Science Foundati...
Background: The software industry spends a lot of money on finding and fixing defects. It utilises ...
AbstractSoftware defect prediction models are classifiers often built by setting a threshold t on a ...
Defect models that are trained on class imbalanced datasets (i.e., the proportion of defective and c...
Reliably predicting software defects is one of the holy grails of software engineering. Researchers ...
Context. Reports suggest that defects in code cost the US in excess of $50billion per year to put ri...
During the last 10 years, hundreds of different defect prediction models have been published. The pe...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
Software defect prediction performance varies over a large range. Menzies suggested there is a ceili...
Open Access: This article is distributed under the terms of the Creative Commons Attribution 4.0 Int...
During the last 10 years, hundreds of different defect prediction models have been published. The p...
Abstract—Defect prediction models help software quality as-surance teams to effectively allocate the...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
Software defect prediction is motivated by the huge costs incurred as a result of software failures...
Software defect prediction strives to improve software quality and testing efficiency by constructin...
National Key Basic Research Program of China [2018YFB1004401]; the National Natural Science Foundati...
Background: The software industry spends a lot of money on finding and fixing defects. It utilises ...
AbstractSoftware defect prediction models are classifiers often built by setting a threshold t on a ...
Defect models that are trained on class imbalanced datasets (i.e., the proportion of defective and c...