Univariate decision tree algorithms are widely used in Data Mining because (i) they are easy to learn (ii) when trained they can be expressed in rule based manner. In several applications mainly including Data Mining, the dataset to be learned is very large. In those cases it is highly desirable to construct univariate decision trees in reasonable time. This may be accomplished by parallelizing univariate decision tree algorithms. In this paper, we first present two different univariate decision tree algorithms C4.5 and univariate Linear Discriminant Tree. We show how to parallelize these algorithms in three ways: (i) feature based, (ii) node based (iii) data based manners. Experimental results show that performance of the parallelizations ...
Learning decision trees against very large amounts of data is not practical on single node computer...
Classification is an important data mining problem. Although classification is a wellstudied problem...
In this paper, we present ScalParC (Scalable Parallel Classifier), a new parallel formulation of a d...
Abstract. In the fields of data mining and machine learning the amount of data available for buildin...
Abstract Decision tree (and its extensions such as Gradient Boosting Decision Trees and Random Fores...
This paper presents a study that discusses how multi-threading can be used to improve the runtime pe...
Abstract. Decision trees are one of the most effective and widely used induction methods that have r...
One of the important problems in data mining is discov-ering classification models from datasets. Ap...
Data mining is the process of discovering interesting and useful patterns and relationships in large...
One of the important problems in data mining [SAD + 93] is the classification-rule learning. The c...
Abstract—Decision tree construction is a well-studied data mining problem. In this paper, we focus o...
Data mining refers to the process of finding hidden patterns inside a large dataset. While improving...
We discuss and test empirically the effects of six dimensions along which existing decision tree ind...
Abstract. In most of data mining systems decision trees are induced in a top-down manner. This greed...
In this paper, we present ScalParC (Scalable Parallel Classifier), a new parallel formulation of a d...
Learning decision trees against very large amounts of data is not practical on single node computer...
Classification is an important data mining problem. Although classification is a wellstudied problem...
In this paper, we present ScalParC (Scalable Parallel Classifier), a new parallel formulation of a d...
Abstract. In the fields of data mining and machine learning the amount of data available for buildin...
Abstract Decision tree (and its extensions such as Gradient Boosting Decision Trees and Random Fores...
This paper presents a study that discusses how multi-threading can be used to improve the runtime pe...
Abstract. Decision trees are one of the most effective and widely used induction methods that have r...
One of the important problems in data mining is discov-ering classification models from datasets. Ap...
Data mining is the process of discovering interesting and useful patterns and relationships in large...
One of the important problems in data mining [SAD + 93] is the classification-rule learning. The c...
Abstract—Decision tree construction is a well-studied data mining problem. In this paper, we focus o...
Data mining refers to the process of finding hidden patterns inside a large dataset. While improving...
We discuss and test empirically the effects of six dimensions along which existing decision tree ind...
Abstract. In most of data mining systems decision trees are induced in a top-down manner. This greed...
In this paper, we present ScalParC (Scalable Parallel Classifier), a new parallel formulation of a d...
Learning decision trees against very large amounts of data is not practical on single node computer...
Classification is an important data mining problem. Although classification is a wellstudied problem...
In this paper, we present ScalParC (Scalable Parallel Classifier), a new parallel formulation of a d...