This tutorial covers the basics of how to use statistical tests to evaluate and compare search-algorithms, in particular when applied on software engineering problems. Search-algorithms like Hill Climbing and Genetic Algorithms are randomised. Running such randomised algorithms twice on the same problem can give different results. It is hence important to run such algorithms multiple times to collect average results, and avoid so publishing wrong conclusions that were based on just luck. However, there is the question of how often such runs should be repeated. Given a set of n repeated experiments, is such n large enough to draw sound conclusions? Or should had more experiments been run? Statistical tests like the Wilcoxon-Mann-W...
Three recent random search algorithms are compared on the basis of efficiency, and on the basis of s...
Search-based statistical structural testing (SBSST) is a promising technique that uses automated sea...
Success rate is a commonly adopted performance criterion for evaluating Evolutionary Algorithms due ...
This tutorial covers the basics of how to use statistical tests to evaluate and compare search-algor...
Software testing is an expensive process, which is vital in the industry. Construction of the test-d...
Abstract—Statistical testing generates test inputs by sampling from a probability distribution that ...
peer reviewedEvolutionary algorithms have been shown to be effective at generating unit test suites...
Abstract. Software testing is an expensive process, which is vital in the industry. Construction of ...
peer reviewedRandomized algorithms are widely used to address many types of software engineering pro...
The use of random search is very poor at finding solutions when those solutions occupy a very small ...
Heuristic algorithms are often difficult to analyse theoretically; this holds in particular for adva...
Mutation testing is widely used in experiments. Some papers experiment with mutation directly, while...
AbstractTest functions are commonly used to evaluate the effectiveness of different search algorithm...
This work presents a statistically principled method for estimating the required number of instances...
Testing a software artefact using every one of its possible inputs would normally cost too much, and...
Three recent random search algorithms are compared on the basis of efficiency, and on the basis of s...
Search-based statistical structural testing (SBSST) is a promising technique that uses automated sea...
Success rate is a commonly adopted performance criterion for evaluating Evolutionary Algorithms due ...
This tutorial covers the basics of how to use statistical tests to evaluate and compare search-algor...
Software testing is an expensive process, which is vital in the industry. Construction of the test-d...
Abstract—Statistical testing generates test inputs by sampling from a probability distribution that ...
peer reviewedEvolutionary algorithms have been shown to be effective at generating unit test suites...
Abstract. Software testing is an expensive process, which is vital in the industry. Construction of ...
peer reviewedRandomized algorithms are widely used to address many types of software engineering pro...
The use of random search is very poor at finding solutions when those solutions occupy a very small ...
Heuristic algorithms are often difficult to analyse theoretically; this holds in particular for adva...
Mutation testing is widely used in experiments. Some papers experiment with mutation directly, while...
AbstractTest functions are commonly used to evaluate the effectiveness of different search algorithm...
This work presents a statistically principled method for estimating the required number of instances...
Testing a software artefact using every one of its possible inputs would normally cost too much, and...
Three recent random search algorithms are compared on the basis of efficiency, and on the basis of s...
Search-based statistical structural testing (SBSST) is a promising technique that uses automated sea...
Success rate is a commonly adopted performance criterion for evaluating Evolutionary Algorithms due ...