An Extensive Analysis of Efficient Bug Prediction Configurations

Osman, Haidar
Ghafari, Mohammad
Nierstrasz, Oscar Marius
Lungu, Mircea

Open link

Publication date

January 2017

DOI

10.1145/3127005.3127017

Publisher

Association for Computing Machinery (ACM)

Citation count (estimate)

Abstract

Background: Bug prediction helps developers steer maintenance activities towards the buggy parts of a software. There are many design aspects to a bug predictor, each of which has several options, i.e. software metrics, machine learning model, and response variable. Aims: These design decisions should be judiciously made because an improper choice in any of them might lead to wrong, misleading, or even useless results. We argue that bug prediction configurations are intertwined and thus need to be evaluated in their entirety, in contrast to the common practice in the field where each aspect is investigated in isolation. Method: We use a cost-aware evaluation scheme to evaluate 60 different bug prediction configuration combinations on five o...