We present an algorithm designed to efficiently construct a decision tree over heterogeneously distributed data without centralizing. We compare our algorithm against a standard centralized decision tree implementation in terms of accuracy as well as the communication complexity. Our experimental results show that by using only 20 % of the communication cost necessary to centralize the data we can achieve trees with accuracy at least 80 % of the trees produced by the centralized version
Abstract. In the fields of data mining and machine learning the amount of data available for buildin...
With the advantages of being easy to understand and efficient to compute, the decision tree method h...
This paper presents a study that discusses how multi-threading can be used to improve the runtime pe...
This paper motivates and precisely formulates the problem of learning from distributed data; descri...
Abstract Decision tree (and its extensions such as Gradient Boosting Decision Trees and Random Fores...
A communication theory approach to decision tree design based on a top-town mutual information algor...
Abstract—Classification based on decision trees is one of the important problems in data mining and ...
Abstract: The construction of efficient decision and classification trees is a fundamental task in B...
Abstract. In most of data mining systems decision trees are induced in a top-down manner. This greed...
Data mining is nontrivial extraction of implicit, previously unknown and potential useful informatio...
This thesis investigates the problem of growing decision trees from data, for the purposes of classi...
This paper treats the problem of construction of efficient decision trees. Construction of optimal d...
Optimal decision trees are not easily improvable in terms of accuracy. However, improving the pre-pr...
Abstract. Selecting the close-to-optimal collective algorithm based on the parameters of the collect...
This paper treats the problem of conversion of decision tables to decision trees. In most cases, the...
Abstract. In the fields of data mining and machine learning the amount of data available for buildin...
With the advantages of being easy to understand and efficient to compute, the decision tree method h...
This paper presents a study that discusses how multi-threading can be used to improve the runtime pe...
This paper motivates and precisely formulates the problem of learning from distributed data; descri...
Abstract Decision tree (and its extensions such as Gradient Boosting Decision Trees and Random Fores...
A communication theory approach to decision tree design based on a top-town mutual information algor...
Abstract—Classification based on decision trees is one of the important problems in data mining and ...
Abstract: The construction of efficient decision and classification trees is a fundamental task in B...
Abstract. In most of data mining systems decision trees are induced in a top-down manner. This greed...
Data mining is nontrivial extraction of implicit, previously unknown and potential useful informatio...
This thesis investigates the problem of growing decision trees from data, for the purposes of classi...
This paper treats the problem of construction of efficient decision trees. Construction of optimal d...
Optimal decision trees are not easily improvable in terms of accuracy. However, improving the pre-pr...
Abstract. Selecting the close-to-optimal collective algorithm based on the parameters of the collect...
This paper treats the problem of conversion of decision tables to decision trees. In most cases, the...
Abstract. In the fields of data mining and machine learning the amount of data available for buildin...
With the advantages of being easy to understand and efficient to compute, the decision tree method h...
This paper presents a study that discusses how multi-threading can be used to improve the runtime pe...