This thesis focuses on the algorithm selection problem, in which the goal is to recommend machine learning algorithms to a new dataset. The idea behind solving this issue is that algorithm performs similarly on similar datasets. The usual approach is to base the similarity measure on the fixed vector of metafeatures extracted out of each dataset. However, as the number of attributes among datasets varies, we may be loosing important information. Herein, we propose a family of algorithms able to handle even the non-propositional representations of datasets. Our methods use the idea of attribute assignment that builds the distance measure between datasets as a sum of distance given by the optimal assignment and an attribute distance measure. ...
International audienceSimilarity between objects plays an important role in both human cognitive pro...
The use of distance metrics such as the Euclidean or Manhattan distance for nearest neighbour algori...
Abstract: The problem of feature selection in data mining is an important real-world problem that in...
This thesis focuses on the algorithm selection problem, in which the goal is to recommend machine le...
In Data Mining, during the preprocessing step, there is a considerable diversity of candidate algori...
In Data Mining, during the preprocessing step, there is a considerable diversity of candidate algori...
Nowadays there is vast amount of data being collected and stored in databases and without automatic ...
This paper proposes and surveys genetic implementations of algorithms for selection and partitioning...
With the increase of available data, companies are looking for ways to use it to their advantages. O...
A typical machine learning algorithm takes advantage of training data to discover patterns among obs...
In optimization, algorithm selection, which is the selection of the most suitable algorithm for a sp...
Abstract. In this paper we address the problem of multiobjective attribute selection in data mining....
This paper discusses a genetic-algorithm-based approach for selecting a small number of representati...
The Algorithm Selection Problem is to select the most appropriate way for solving a problem given a ...
Clustering is a difficult and widely studied data mining task, with many varieties of clustering alg...
International audienceSimilarity between objects plays an important role in both human cognitive pro...
The use of distance metrics such as the Euclidean or Manhattan distance for nearest neighbour algori...
Abstract: The problem of feature selection in data mining is an important real-world problem that in...
This thesis focuses on the algorithm selection problem, in which the goal is to recommend machine le...
In Data Mining, during the preprocessing step, there is a considerable diversity of candidate algori...
In Data Mining, during the preprocessing step, there is a considerable diversity of candidate algori...
Nowadays there is vast amount of data being collected and stored in databases and without automatic ...
This paper proposes and surveys genetic implementations of algorithms for selection and partitioning...
With the increase of available data, companies are looking for ways to use it to their advantages. O...
A typical machine learning algorithm takes advantage of training data to discover patterns among obs...
In optimization, algorithm selection, which is the selection of the most suitable algorithm for a sp...
Abstract. In this paper we address the problem of multiobjective attribute selection in data mining....
This paper discusses a genetic-algorithm-based approach for selecting a small number of representati...
The Algorithm Selection Problem is to select the most appropriate way for solving a problem given a ...
Clustering is a difficult and widely studied data mining task, with many varieties of clustering alg...
International audienceSimilarity between objects plays an important role in both human cognitive pro...
The use of distance metrics such as the Euclidean or Manhattan distance for nearest neighbour algori...
Abstract: The problem of feature selection in data mining is an important real-world problem that in...