In this paper, we propose a new approach for apply-ing data mining techniques, and more particularly super-vised machine learning algorithms, to large databases, in acceptable response times. This goal is achieved by inte-grating these algorithms within a Database Management System.Weare thus only limited by disk capacity, andnot by available main memory. However, the disk accesses that are necessary to scan the database induce long re-sponse times. Hence, we propose an original method to reduce the size of the learning set by building its contin-gency table. The machine learning algorithms are then adapted to operate on this contingency table. In order to validate our approach, we implemented the ID3 decision tree construction method and s...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
One of the main obstacles in applying data mining techniques to large, real-world databases is the l...
Modern techniques of capturing data have thrown, besides storage, another couple of challenges to th...
We propose in this paper a new approach for applying data mining al-gorithms, and more particularly ...
Abstract Recent years have shown the need of an automated process to discover interesting and hidden...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
The efficient mining of large, commercially credible, databases requires a solution to at least two ...
With the increasing demands of transforming raw data into information and knowledge, data mining be...
In a world with an ever-increasing amount of data processed, providing tools for highquality and fas...
International audienceWith the emergence of machine learning (ML) techniques in database research, M...
With the wide availability of huge amounts of data and the imminent demands to transform the raw dat...
Abstract: In the context of data mining the feature size is very large and it is believed that it ne...
In this paper we propose a scaling-up method that is applicable to essentially any induction algorit...
We present a general approach to speeding up a family of multi-relational data mining algorithms t...
doi:10.1214/lnms/1196285404Data mining is a process of discovering useful patterns (knowledge) hidde...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
One of the main obstacles in applying data mining techniques to large, real-world databases is the l...
Modern techniques of capturing data have thrown, besides storage, another couple of challenges to th...
We propose in this paper a new approach for applying data mining al-gorithms, and more particularly ...
Abstract Recent years have shown the need of an automated process to discover interesting and hidden...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
The efficient mining of large, commercially credible, databases requires a solution to at least two ...
With the increasing demands of transforming raw data into information and knowledge, data mining be...
In a world with an ever-increasing amount of data processed, providing tools for highquality and fas...
International audienceWith the emergence of machine learning (ML) techniques in database research, M...
With the wide availability of huge amounts of data and the imminent demands to transform the raw dat...
Abstract: In the context of data mining the feature size is very large and it is believed that it ne...
In this paper we propose a scaling-up method that is applicable to essentially any induction algorit...
We present a general approach to speeding up a family of multi-relational data mining algorithms t...
doi:10.1214/lnms/1196285404Data mining is a process of discovering useful patterns (knowledge) hidde...
Efficiency is crucial in KDD (Knowledge Discovery in Databases), due to the huge amount of data stor...
One of the main obstacles in applying data mining techniques to large, real-world databases is the l...
Modern techniques of capturing data have thrown, besides storage, another couple of challenges to th...