Abstract—Classification based on decision trees is one of the important problems in data mining and has applications in many fields. In recent years, database systems have become highly distributed, and distributed system paradigms, such as federated and peer-topeer databases, are being adopted. In this paper, we consider the problem of inducing decision trees in a large distributed network of genomic databases. Our work is motivated by the existence of distributed databases in healthcare and in bioinformatics, and by the emergence of systems which automatically analyze these databases, and by the expectancy that these databases will soon contain large amounts of highly dimensional genomic data. Current decision tree algorithms require high...
Classification of very large datasets is a challenging problem in data mining. It is desirable to h...
Several algorithms have been proposed in the literature for building decision trees (DT) for large d...
This study reports the empirical analysis of a hyper-heuristic evolutionary algorithm that is capabl...
Data mining is nontrivial extraction of implicit, previously unknown and potential useful informatio...
Abstract. In most of data mining systems decision trees are induced in a top-down manner. This greed...
This paper motivates and precisely formulates the problem of learning from distributed data; descri...
The study proposes a decision tree based classification of gene expression and protein display data....
Hierarchical multilabel classification (HMC) is a variant of classification where instances may belo...
We present an algorithm designed to efficiently construct a decision tree over heterogeneously distr...
Hierarchical multilabel classification (HMC) is a variant of classification where instances may belo...
Most algorithms for learning and pattern discovery in data assume that all the needed data is availa...
A machine learning technique called Graph-Based Induction (GBI) efficiently extracts typical pattern...
In this chapter, we introduce the reader to a popular family of machine learning algorithms, called ...
DNA microarrays (gene chips), frequently used in biological and medical studies, measure the express...
Data mining approaches have been increasingly used in recent years in order to find patterns and reg...
Classification of very large datasets is a challenging problem in data mining. It is desirable to h...
Several algorithms have been proposed in the literature for building decision trees (DT) for large d...
This study reports the empirical analysis of a hyper-heuristic evolutionary algorithm that is capabl...
Data mining is nontrivial extraction of implicit, previously unknown and potential useful informatio...
Abstract. In most of data mining systems decision trees are induced in a top-down manner. This greed...
This paper motivates and precisely formulates the problem of learning from distributed data; descri...
The study proposes a decision tree based classification of gene expression and protein display data....
Hierarchical multilabel classification (HMC) is a variant of classification where instances may belo...
We present an algorithm designed to efficiently construct a decision tree over heterogeneously distr...
Hierarchical multilabel classification (HMC) is a variant of classification where instances may belo...
Most algorithms for learning and pattern discovery in data assume that all the needed data is availa...
A machine learning technique called Graph-Based Induction (GBI) efficiently extracts typical pattern...
In this chapter, we introduce the reader to a popular family of machine learning algorithms, called ...
DNA microarrays (gene chips), frequently used in biological and medical studies, measure the express...
Data mining approaches have been increasingly used in recent years in order to find patterns and reg...
Classification of very large datasets is a challenging problem in data mining. It is desirable to h...
Several algorithms have been proposed in the literature for building decision trees (DT) for large d...
This study reports the empirical analysis of a hyper-heuristic evolutionary algorithm that is capabl...