Associated research group: Critical Systems Research GroupContext: There are many methods that input static code features and output a predictor for faulty code modules. These data mining methods have hit a "performance ceiling"; i.e., some inherent upper bound on the amount of information offered by, say, static code features when identifying modules which contain faults. Objective: We seek an explanation for this ceiling effect. Perhaps static code features have "limited information content"; i.e. their information can be quickly and completely discovered by even simple learners. Method:An initial literature review documents the ceiling effect in other work. Next, using three sub-sampling techniques (under-, over-, and micro-sampling), we...
Context. Reports suggest that defects in code cost the US in excess of $50billion per year to put ri...
During the last 10 years, hundreds of different defect prediction models have been published. The pe...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...
While data miners can learn defect predictors from static code features, the performance improve-men...
The goal of the PROMISE workshop is to create a set of re-producible software engineering experiment...
In defect prediction studies, open-source and real-world defect data sets are frequently used. The q...
Background: Software defect prediction has been an active area of research for the last few decades....
Software defect prediction performance varies over a large range. Menzies suggested there is a ceili...
Many studies have been carried out to predict the presence of software code defects using static cod...
Bug prediction is aimed at identifying software artifacts that are more likely to be defective in th...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...
Context: The adequacy of fault-proneness prediction models in representing the relationship between ...
Abstract—Short abstract needed, please. Index Terms—Defect prediction, accuracy measures, static cod...
Defect models that are trained on class imbalanced datasets (i.e., the proportion of defective and c...
Directly learning a defect prediction model from cross-project datasets results in a model with poor...
Context. Reports suggest that defects in code cost the US in excess of $50billion per year to put ri...
During the last 10 years, hundreds of different defect prediction models have been published. The pe...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...
While data miners can learn defect predictors from static code features, the performance improve-men...
The goal of the PROMISE workshop is to create a set of re-producible software engineering experiment...
In defect prediction studies, open-source and real-world defect data sets are frequently used. The q...
Background: Software defect prediction has been an active area of research for the last few decades....
Software defect prediction performance varies over a large range. Menzies suggested there is a ceili...
Many studies have been carried out to predict the presence of software code defects using static cod...
Bug prediction is aimed at identifying software artifacts that are more likely to be defective in th...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...
Context: The adequacy of fault-proneness prediction models in representing the relationship between ...
Abstract—Short abstract needed, please. Index Terms—Defect prediction, accuracy measures, static cod...
Defect models that are trained on class imbalanced datasets (i.e., the proportion of defective and c...
Directly learning a defect prediction model from cross-project datasets results in a model with poor...
Context. Reports suggest that defects in code cost the US in excess of $50billion per year to put ri...
During the last 10 years, hundreds of different defect prediction models have been published. The pe...
Context: Identifying defects in code early is important. A wide range of static code metrics have be...