Exploiting Parallelism in Decision Tree Induction. In Parallel and Distributed computing for Machine Learning

Nuno Amado
O Silva

Publication date

January 2003

Abstract

Abstract. In the fields of data mining and machine learning the amount of data available for building classifiers is growing very fast. Parallelism may be a good solution to reduce the amount of time spent in building classifiers from very-large datasets while keeping the classification accu-racy. This work first overviews some strategies for implementing decision tree construction algorithms in parallel based on techniques such as task parallelism, data parallelism and hybrid parallelism. We then describe a new parallel implementation of the C4.5 decision tree construction algo-rithm using a breadth-first strategy, data and hybrid parallelism tech-niques. A novel contribution of this work is the ability to deal with miss-ing values. Even t...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Exploiting Parallelism in Decision Tree Induction. In Parallel and Distributed computing for Machine Learning

Abstract

Extracted data

Exploiting Parallelism in Decision Tree Induction. In Parallel and Distributed computing for Machine Learning

Abstract

Extracted data

Related items

Related items