Context: Conducting experiments is central to research machine learning research to benchmark, evaluate and compare learning algorithms. Consequently it is important we conduct reliable, trustworthy experiments. Objective: We investigate the incidence of errors in a sample of machine learning experiments in the domain of software defect prediction. Our focus is simple arithmetical and statistical errors. Method: We analyse 49 papers describing 2456 individual experimental results from a previously undertaken systematic review comparing supervised and unsupervised defect prediction classifiers. We extract the confusion matrices and test for relevant constraints, e.g., the marginal probabilities must sum to one. We also check for multiple sta...
National Key Basic Research Program of China [2018YFB1004401]; the National Natural Science Foundati...
Machine learning (ML) inherently suffers from at least a small amount of inaccuracy. Typically, thes...
In 1988, Langley wrote an influential editorial in the journal Machine Learning titled \u201cMachine...
This data set contains the results of an extensive, systematic literature review on the use of machi...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
Context The trustworthiness of research results is a growing concern in many empirical disciplines. ...
IEEE In 2014 we published a meta-analysis of software defect prediction studies [1]. This suggested ...
In the last years, Machine Learning (ML) has become extremely used in software systems: it is applie...
Much research on Machine Learning testing relies on empirical studies that evaluate and show their p...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
This dataset is about a systematic review of unsupervised learning techniques for software defect pr...
© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Background: There has been much discussion amongst automated software defect prediction researchers ...
Background Examples of questionable statistical practice, when published in high quality software e...
OBJECTIVE: To analyse data from a trial and report the frequencies with which major and minor errors...
National Key Basic Research Program of China [2018YFB1004401]; the National Natural Science Foundati...
Machine learning (ML) inherently suffers from at least a small amount of inaccuracy. Typically, thes...
In 1988, Langley wrote an influential editorial in the journal Machine Learning titled \u201cMachine...
This data set contains the results of an extensive, systematic literature review on the use of machi...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
Context The trustworthiness of research results is a growing concern in many empirical disciplines. ...
IEEE In 2014 we published a meta-analysis of software defect prediction studies [1]. This suggested ...
In the last years, Machine Learning (ML) has become extremely used in software systems: it is applie...
Much research on Machine Learning testing relies on empirical studies that evaluate and show their p...
Background. The ability to predict defect-prone software components would be valuable. Consequently,...
This dataset is about a systematic review of unsupervised learning techniques for software defect pr...
© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Background: There has been much discussion amongst automated software defect prediction researchers ...
Background Examples of questionable statistical practice, when published in high quality software e...
OBJECTIVE: To analyse data from a trial and report the frequencies with which major and minor errors...
National Key Basic Research Program of China [2018YFB1004401]; the National Natural Science Foundati...
Machine learning (ML) inherently suffers from at least a small amount of inaccuracy. Typically, thes...
In 1988, Langley wrote an influential editorial in the journal Machine Learning titled \u201cMachine...