This work presents a statistically principled method for estimating the required number of instances in the experimental comparison of multiple algorithms on a given problem class of interest. This approach generalises earlier results by allowing researchers to design experiments based on the desired best, worst, mean or median-case statistical power to detect differences between algorithms larger than a certain threshold. Holm’s step-down procedure is used to maintain the overall significance level controlled at desired levels, without resulting in overly conservative experiments. This paper also presents an approach for sampling each algorithm on each instance, based on optimal sample size ratios that minimise the total required number of...
Despite the continuous advancement of Evolutionary Algorithms (EAs) and their numerous successful ap...
A comparison of algorithms with respect to certain performance measures is of special interest in th...
In empirical studies of Evolutionary Algorithms, it is usually desirable to evaluate and compare alg...
This work presents a statistically principled method for estimating the required number of instances...
Experimental comparisons of performance represent an important aspect of research on optimization al...
Experimental algorithmics encompasses the study of guidelines and methods for computational evaluati...
Includes bibliographical references (p. 25-26).Ravindra K. Ahuja, James B. Orlin
Benchmark experiments nowadays are the method of choice to evaluate learn-ing algorithms in most res...
This paper proposes a statistical methodology for comparing the performance of evolutionary computat...
International audienceEmpirical performance evaluations, in competitions and scientific publications...
The performance of combinatorial algorithms is often evaluated by using the computational times of a...
This tutorial covers the basics of how to use statistical tests to evaluate and compare search-algo...
This article introduces alternative techniques to compare algorithmic performance. The first approac...
An experimental design is a formula or algorithm that specifies how resources are to be utilized thr...
Thousands of Machine Learning research papers contain experimental comparisons that usually have bee...
Despite the continuous advancement of Evolutionary Algorithms (EAs) and their numerous successful ap...
A comparison of algorithms with respect to certain performance measures is of special interest in th...
In empirical studies of Evolutionary Algorithms, it is usually desirable to evaluate and compare alg...
This work presents a statistically principled method for estimating the required number of instances...
Experimental comparisons of performance represent an important aspect of research on optimization al...
Experimental algorithmics encompasses the study of guidelines and methods for computational evaluati...
Includes bibliographical references (p. 25-26).Ravindra K. Ahuja, James B. Orlin
Benchmark experiments nowadays are the method of choice to evaluate learn-ing algorithms in most res...
This paper proposes a statistical methodology for comparing the performance of evolutionary computat...
International audienceEmpirical performance evaluations, in competitions and scientific publications...
The performance of combinatorial algorithms is often evaluated by using the computational times of a...
This tutorial covers the basics of how to use statistical tests to evaluate and compare search-algo...
This article introduces alternative techniques to compare algorithmic performance. The first approac...
An experimental design is a formula or algorithm that specifies how resources are to be utilized thr...
Thousands of Machine Learning research papers contain experimental comparisons that usually have bee...
Despite the continuous advancement of Evolutionary Algorithms (EAs) and their numerous successful ap...
A comparison of algorithms with respect to certain performance measures is of special interest in th...
In empirical studies of Evolutionary Algorithms, it is usually desirable to evaluate and compare alg...