The amount of data produced by ubiquitous computing applications is quickly growing, due to the pervasive presence of small devices endowed with sensing, computing and communication capabilities. Heterogeneity and strong interdependence, which characterize 'ubiquitous data', require a (multi-) relational approach to their analysis. However, relational data mining algorithms do not scale well and very large data sets are hardly processable. In this paper we propose an extension of a relational algorithm for multi-level frequent pattern discovery, which resorts to data sampling and distributed computation in Grid environments, in order to overcome the computational limits of the original serial algorithm. The set of patterns discovered by the...
In recent years, knowledge discovery in databases provides a powerful capability to discover meaning...
We propose a novel pattern tree called Pattern Count tree (PC- tree) which is a complete and compact...
The purpose of data mining from distributed information systems is usually threefold: (1) identifyin...
The amount of data produced by ubiquitous computing applications is quickly growing, due to the perv...
Recently, several algorithms based on the MapReduce framework have been proposed for frequent patter...
Industrial, scientific, and commercial applications use information systems to trace the execution o...
Industrial, scientific, and commercial applications use information systems to trace the execution o...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...
Abstract Traditional methods for data mining typically make the assumption that data is centralized ...
Frequent pattern mining is an essential data mining task, with a goal of discovering knowledge in th...
Efficient mining of frequent patterns from large databases has been an active area of research since...
Traditional methods for frequent itemset mining typically assume that data is centralized and static...
International audienceDespite crucial recent advances, the problem of frequent itemset mining is sti...
In this paper we present DCI, a new data mining algorithm for frequent set counting. We also discuss...
Data Mining is one of the central activities associated with understanding and exploiting the world...
In recent years, knowledge discovery in databases provides a powerful capability to discover meaning...
We propose a novel pattern tree called Pattern Count tree (PC- tree) which is a complete and compact...
The purpose of data mining from distributed information systems is usually threefold: (1) identifyin...
The amount of data produced by ubiquitous computing applications is quickly growing, due to the perv...
Recently, several algorithms based on the MapReduce framework have been proposed for frequent patter...
Industrial, scientific, and commercial applications use information systems to trace the execution o...
Industrial, scientific, and commercial applications use information systems to trace the execution o...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...
Abstract Traditional methods for data mining typically make the assumption that data is centralized ...
Frequent pattern mining is an essential data mining task, with a goal of discovering knowledge in th...
Efficient mining of frequent patterns from large databases has been an active area of research since...
Traditional methods for frequent itemset mining typically assume that data is centralized and static...
International audienceDespite crucial recent advances, the problem of frequent itemset mining is sti...
In this paper we present DCI, a new data mining algorithm for frequent set counting. We also discuss...
Data Mining is one of the central activities associated with understanding and exploiting the world...
In recent years, knowledge discovery in databases provides a powerful capability to discover meaning...
We propose a novel pattern tree called Pattern Count tree (PC- tree) which is a complete and compact...
The purpose of data mining from distributed information systems is usually threefold: (1) identifyin...