Selecting a representative decision tree from an ensemble of decision-tree models for fast big data classification

Abraham Itzhak Weinberg
Mark Last

Open link

Publication date

February 2019

DOI

10.1186/s40537-019-0186-3

Publisher

Springer Science and Business Media LLC

Journal

Journal Of Big Data

Abstract

Abstract The goal of this paper is to reduce the classification (inference) complexity of tree ensembles by choosing a single representative model out of ensemble of multiple decision-tree models. We compute the similarity between different models in the ensemble and choose the model, which is most similar to others as the best representative of the entire dataset. The similarity-based approach is implemented with three different similarity metrics: a syntactic, a semantic, and a linear combination of the two. We compare this tree selection methodology to a popular ensemble algorithm (majority voting) and to the baseline of randomly choosing one of the local models. In addition, we evaluate two alternative tree selection strategies: choosin...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Selecting a representative decision tree from an ensemble of decision-tree models for fast big data classification

Abstract

Extracted data

Selecting a representative decision tree from an ensemble of decision-tree models for fast big data classification

Abstract

Extracted data

Related items

Related items