When using a greedy algorithm for finding a model, as is the case in many data mining algorithms, there is a risk of getting caught in local extrema, i.e., suboptimal solutions. Widening is a technique for enhancing greedy algorithms by using parallel resources to broaden the search in the model space. The most important component of widening is the selector, a function that chooses the next models to refine. This selector ideally enforces diversity within the selected set of models in order to ensure that parallel workers explore sufficiently different parts of the model space and do not end up mimicking a simple beam search. Previous publications have shown that this works well for problems with a suitable distance measure for the models,...
Niching methods extend genetic algorithms to domains that require the location and maintenance of mu...
Feature selection is a fundamental problem in machine learning and data mining. The majority of feat...
We consider the problem of variable or feature selection for model-based clustering. The problem of ...
When using a greedy algorithm for finding a model, as is the case in many data mining algorithms, th...
This paper follows our earlier publication, where we introduced the idea of tuned data mining which ...
We live in the age of ever-increasing parallel computing resources, and as a consequence, for a deca...
We present a selection algorithm called BucketSelect which runs faster than Floyd-Rivest\u27s Select...
Greedy algorithms (also called “Hill Climbing”) are algorithms that are iterative in nature and choo...
This paper introduces a greedy method of performing k-fold cross validation and shows how the propos...
Given a set of models and some training data, we would like to find the model which best describes t...
With increasing availability and power of parallel computational resources, attention is drawn to th...
The Algorithm Selection Problem is to select the most appropriate way for solving a problem given a ...
Subset selection is fundamental in combinatorial optimization with applications in biology, operatio...
Selecting a good model of a set of input points by cross validation is a computationally intensive p...
Suppose there is a large collection of items, each with an as-sociated cost and an inherent utility ...
Niching methods extend genetic algorithms to domains that require the location and maintenance of mu...
Feature selection is a fundamental problem in machine learning and data mining. The majority of feat...
We consider the problem of variable or feature selection for model-based clustering. The problem of ...
When using a greedy algorithm for finding a model, as is the case in many data mining algorithms, th...
This paper follows our earlier publication, where we introduced the idea of tuned data mining which ...
We live in the age of ever-increasing parallel computing resources, and as a consequence, for a deca...
We present a selection algorithm called BucketSelect which runs faster than Floyd-Rivest\u27s Select...
Greedy algorithms (also called “Hill Climbing”) are algorithms that are iterative in nature and choo...
This paper introduces a greedy method of performing k-fold cross validation and shows how the propos...
Given a set of models and some training data, we would like to find the model which best describes t...
With increasing availability and power of parallel computational resources, attention is drawn to th...
The Algorithm Selection Problem is to select the most appropriate way for solving a problem given a ...
Subset selection is fundamental in combinatorial optimization with applications in biology, operatio...
Selecting a good model of a set of input points by cross validation is a computationally intensive p...
Suppose there is a large collection of items, each with an as-sociated cost and an inherent utility ...
Niching methods extend genetic algorithms to domains that require the location and maintenance of mu...
Feature selection is a fundamental problem in machine learning and data mining. The majority of feat...
We consider the problem of variable or feature selection for model-based clustering. The problem of ...