doi:10.1214/lnms/1196285404Data mining is a process of discovering useful patterns (knowledge) hidden in extremely large datasets. Classification is a fundamental data mining function, and some other functions can be reduced to it. In this paper we propose a novel classification algorithm (classifier) called MIND (MINing in Databases). MIND can be phrased in such a way that its implementation is very easy using the extended relational calculus SQL, and this in turn allows the classifier to be built into a relational database system directly. MIND is truly scalable with respect to I/O efficiency, which is important since scalability is a key requirement for any data mining algorithm. We have built a prototype of MIND in the relational databa...
In this paper, we report our success in building efficient scalable classifiers by exploring the cap...
One fundamental limitation of classical statistical modeling is the assumption that data is represen...
Machine Learning is a research field with substantial relevance for many applications in different a...
doi:10.1214/lnms/1196285404Data mining is a process of discovering useful patterns (knowledge) hidde...
Software packages providing a whole set of data mining and machine learning algorithms are attractiv...
With the increasing demands of transforming raw data into information and knowledge, data mining be...
The fast increase in the size and number of databases demands data mining approaches that are scalab...
A new approach is needed to handle huge dataset stored in multiple tables in a very-large database. ...
Multi-relational data mining enables pattern mining from multiple tables. The existing multi-relatio...
In this paper, we propose a new approach for apply-ing data mining techniques, and more particularly...
A major obstacle to fully integrated deployment of many data mining algorithms is the assumption tha...
This paper presents an incremental and scalable learning algorithm in order to mine numeric, low dim...
With the wide availability of huge amounts of data and the imminent demands to transform the raw dat...
Mining for association rules between items in a large database of sales transactions has been descri...
The efficient mining of large, commercially credible, databases requires a solution to at least two ...
In this paper, we report our success in building efficient scalable classifiers by exploring the cap...
One fundamental limitation of classical statistical modeling is the assumption that data is represen...
Machine Learning is a research field with substantial relevance for many applications in different a...
doi:10.1214/lnms/1196285404Data mining is a process of discovering useful patterns (knowledge) hidde...
Software packages providing a whole set of data mining and machine learning algorithms are attractiv...
With the increasing demands of transforming raw data into information and knowledge, data mining be...
The fast increase in the size and number of databases demands data mining approaches that are scalab...
A new approach is needed to handle huge dataset stored in multiple tables in a very-large database. ...
Multi-relational data mining enables pattern mining from multiple tables. The existing multi-relatio...
In this paper, we propose a new approach for apply-ing data mining techniques, and more particularly...
A major obstacle to fully integrated deployment of many data mining algorithms is the assumption tha...
This paper presents an incremental and scalable learning algorithm in order to mine numeric, low dim...
With the wide availability of huge amounts of data and the imminent demands to transform the raw dat...
Mining for association rules between items in a large database of sales transactions has been descri...
The efficient mining of large, commercially credible, databases requires a solution to at least two ...
In this paper, we report our success in building efficient scalable classifiers by exploring the cap...
One fundamental limitation of classical statistical modeling is the assumption that data is represen...
Machine Learning is a research field with substantial relevance for many applications in different a...