Imbalanced data is a common problem in data mining when dealing with classi cation problems, where samples of a class vastly outnumber other classes. In this situation, many data mining algorithms generate poor models as they try to opti- mize the overall accuracy and perform badly in classes with very few samples. Software Engineering data in general and defect prediction datasets are not an exception and in this paper, we compare different approaches, namely sampling, cost-sensitive, ensemble and hybrid approaches to the prob- lem of defect prediction with different datasets preprocessed differently. We have used the well-known NASA datasets curated by Shepperd et al. There are differences in the re- sults depending on the cha...
Machine learning techniques are frequent for the complicated task of predicting software defects. Of...
Context: Generally, there are more non-defective instances than defective instances in the datasets ...
Software defect prediction is a practical approach to improve the quality and efficiency of time and...
With the availability of high-speed Internet and the advent of Internet of Things devices, modern so...
Fault prediction problem has a crucial role in the software development process because it contribut...
Identifying defective software entities is essential to ensure software quality during software deve...
Software defect prediction performance varies over a large range. Menzies suggested there is a ceili...
In this short paper, we compare well-known rule/tree classifiers in software defect prediction with ...
The main problem in producing high accuracy software defect prediction is if the data set has an imb...
Predicting when and where bugs will appear in software may assist improve quality and save on softwa...
Background: The software industry spends a lot of money on finding and fixing defects. It utilises ...
Software defect prediction research has adopted various evaluation measures to assess the performanc...
Background: Ensemble techniques have gained attention in various scientific fields. Defect predictio...
The software has turn into an imperious part of human’s life. In the recent computing era, many larg...
The most important part in software engineering is a software defect prediction. Software defect pre...
Machine learning techniques are frequent for the complicated task of predicting software defects. Of...
Context: Generally, there are more non-defective instances than defective instances in the datasets ...
Software defect prediction is a practical approach to improve the quality and efficiency of time and...
With the availability of high-speed Internet and the advent of Internet of Things devices, modern so...
Fault prediction problem has a crucial role in the software development process because it contribut...
Identifying defective software entities is essential to ensure software quality during software deve...
Software defect prediction performance varies over a large range. Menzies suggested there is a ceili...
In this short paper, we compare well-known rule/tree classifiers in software defect prediction with ...
The main problem in producing high accuracy software defect prediction is if the data set has an imb...
Predicting when and where bugs will appear in software may assist improve quality and save on softwa...
Background: The software industry spends a lot of money on finding and fixing defects. It utilises ...
Software defect prediction research has adopted various evaluation measures to assess the performanc...
Background: Ensemble techniques have gained attention in various scientific fields. Defect predictio...
The software has turn into an imperious part of human’s life. In the recent computing era, many larg...
The most important part in software engineering is a software defect prediction. Software defect pre...
Machine learning techniques are frequent for the complicated task of predicting software defects. Of...
Context: Generally, there are more non-defective instances than defective instances in the datasets ...
Software defect prediction is a practical approach to improve the quality and efficiency of time and...