This paper follows our earlier publication, where we introduced the idea of tuned data mining which draws on parallel resources to improve model accuracy rather than the usual focus on speed-up. In this paper we present a more in-depth analysis of the concept of Widened Data Mining, which aims at reducing the impact of greedy heuristics by exploring more than just one suitable solution at each step. In particular we focus on how diversity considerations can substantially improve results. We again use the greedy algorithm for the set cover problem to demonstrate these effects in practice
The most successful multi-objective metaheuristics, such as NSGA II and SPEA 2, usually apply a form...
© 2015 ACM. We say that an object o attracts a user u if o is one of the top-k objects according to ...
Understanding the behaviour of heuristic search methods is a challenge. This even holds for simple l...
When using a greedy algorithm for finding a model, as is the case in many data mining algorithms, th...
With increasing availability and power of parallel computational resources, attention is drawn to th...
We live in the age of ever-increasing parallel computing resources, and as a consequence, for a deca...
Greedy algorithms (also called “Hill Climbing”) are algorithms that are iterative in nature and choo...
In this paper we show that diversity-driven widening, the parallel exploration of the model space wi...
Most of the research in parallel data mining and machine learning algorithms is focused on improving...
International audienceIn recent years, pattern mining has moved from a slow-moving repeated three-st...
Recently, result diversification has attracted a lot of atten-tion as a means to improve the quality...
Maintaining diversity is important for the performance of evolutionary algorithms. Diversity-preserv...
Genetic Algorithms provide a weak search method that scales rather badly when used in their traditio...
Constraint Programming is becoming competitive for solving certain data-mining problems largely due ...
Diversity maximization aims to select a diverse and representative subset of items from a large data...
The most successful multi-objective metaheuristics, such as NSGA II and SPEA 2, usually apply a form...
© 2015 ACM. We say that an object o attracts a user u if o is one of the top-k objects according to ...
Understanding the behaviour of heuristic search methods is a challenge. This even holds for simple l...
When using a greedy algorithm for finding a model, as is the case in many data mining algorithms, th...
With increasing availability and power of parallel computational resources, attention is drawn to th...
We live in the age of ever-increasing parallel computing resources, and as a consequence, for a deca...
Greedy algorithms (also called “Hill Climbing”) are algorithms that are iterative in nature and choo...
In this paper we show that diversity-driven widening, the parallel exploration of the model space wi...
Most of the research in parallel data mining and machine learning algorithms is focused on improving...
International audienceIn recent years, pattern mining has moved from a slow-moving repeated three-st...
Recently, result diversification has attracted a lot of atten-tion as a means to improve the quality...
Maintaining diversity is important for the performance of evolutionary algorithms. Diversity-preserv...
Genetic Algorithms provide a weak search method that scales rather badly when used in their traditio...
Constraint Programming is becoming competitive for solving certain data-mining problems largely due ...
Diversity maximization aims to select a diverse and representative subset of items from a large data...
The most successful multi-objective metaheuristics, such as NSGA II and SPEA 2, usually apply a form...
© 2015 ACM. We say that an object o attracts a user u if o is one of the top-k objects according to ...
Understanding the behaviour of heuristic search methods is a challenge. This even holds for simple l...