With increasing availability and power of parallel computational resources, attention is drawn to the question of how best to apply those resources. Instead of simply finding the same answers more quickly, this thesis describes how parallel computational resources are used to explore disparate regions of a solution space by using diversity to steer the solution paths away from each other, thereby discouraging strictly greedy behavior. The formulation of models in a concept/solution space and its relationship to a search space are described as well as common search algorithms with heuristics for time or space computationally prohibitive searches. Measures of diversity are introduced, and the application of a beam search to the solution space...
Data mining over large data-sets is important due to its obvious commercial potential, However, it i...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...
In this paper, we propose an algorithm to partition both the search space and the database for the p...
We live in the age of ever-increasing parallel computing resources, and as a consequence, for a deca...
This paper follows our earlier publication, where we introduced the idea of tuned data mining which ...
With the fast, continuous increase in the number and size of databases, parallel data mining is a na...
Abstract Recent years have shown the need of an automated process to discover interesting and hidden...
When using a greedy algorithm for finding a model, as is the case in many data mining algorithms, th...
Abstract. When computationally feasible, mining huge databases produces tremendously large numbers o...
Abstract. In this paper, we explore the two well-known principles of diversifica-tion and intensific...
The goal of data mining algorithm is to discover useful information embedded in large databases. Fre...
We present a deterministic parallel algorithm for the k-majority problem, that can be used to find i...
International audienceIn this paper, we explore the two well-known principles of diversification and...
Recently, several algorithms based on the MapReduce framework have been proposed for frequent patter...
Parallel Memetic Algorithms (PMAs) are a class of modern parallel meta-heuristics that combine evolu...
Data mining over large data-sets is important due to its obvious commercial potential, However, it i...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...
In this paper, we propose an algorithm to partition both the search space and the database for the p...
We live in the age of ever-increasing parallel computing resources, and as a consequence, for a deca...
This paper follows our earlier publication, where we introduced the idea of tuned data mining which ...
With the fast, continuous increase in the number and size of databases, parallel data mining is a na...
Abstract Recent years have shown the need of an automated process to discover interesting and hidden...
When using a greedy algorithm for finding a model, as is the case in many data mining algorithms, th...
Abstract. When computationally feasible, mining huge databases produces tremendously large numbers o...
Abstract. In this paper, we explore the two well-known principles of diversifica-tion and intensific...
The goal of data mining algorithm is to discover useful information embedded in large databases. Fre...
We present a deterministic parallel algorithm for the k-majority problem, that can be used to find i...
International audienceIn this paper, we explore the two well-known principles of diversification and...
Recently, several algorithms based on the MapReduce framework have been proposed for frequent patter...
Parallel Memetic Algorithms (PMAs) are a class of modern parallel meta-heuristics that combine evolu...
Data mining over large data-sets is important due to its obvious commercial potential, However, it i...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...
In this paper, we propose an algorithm to partition both the search space and the database for the p...