A complexity based pruning procedure for classification trees is described, and bounds on its finite sample performance are established. The procedure selects a subtree of a (possibly random) initial tree in order to minimize a complexity penalized measure of empirical risk. The complexity assigned to a subtree is proportional to the square root of its size. Two cases are considered. In the first the growing and pruning data sets are identical, and in the second they are independent. Using the performance bound, the Bayes risk consistency of pruned trees obtained via the procedure is established when the sequence of initial trees satisfies suitable geometric and structural constraints. The pruning method and its analysis are motivated by wo...
12 pagesInternational audienceThe performance of the Classification And Regression Trees (CART) prun...
Pruning is one of the key procedures in training decision tree classifiers. It removes trivial rules...
decision tree classifiers in two learning situations: minimizing loss and probability estimation. In...
A complexity based pruning procedure for classification trees is described, and bounds on its finite...
Cost complexity pruning of classification trees as introduced in the Classification and Regression T...
This paper provides a new pruning method for classification trees based on the impurity-complexity m...
Decision tree pruning is critical for the construction of good decision trees. The most popular and ...
Probability trees (or Probability Estimation Trees, PET's) are decision trees with probability distr...
Abstract Probability trees are decision trees that predict class probabilities rather than the most ...
Growing amount of high dimensional data requires robust analysis techniques. Tree-based ensemble met...
Various factors aecting decision tree learning time are explored. The factors which consistently aec...
The final publication is available at Springer via http://dx.doi.org/10.1007/11875581_119Proceedings...
Top-down induction of decision trees has been observed to suer from the inadequate functioning of th...
The pruning phase is one of the necessary steps in decision tree induction. Existing pruning algorit...
By identifying relationships between regression tree construction and change-point detection, we sho...
12 pagesInternational audienceThe performance of the Classification And Regression Trees (CART) prun...
Pruning is one of the key procedures in training decision tree classifiers. It removes trivial rules...
decision tree classifiers in two learning situations: minimizing loss and probability estimation. In...
A complexity based pruning procedure for classification trees is described, and bounds on its finite...
Cost complexity pruning of classification trees as introduced in the Classification and Regression T...
This paper provides a new pruning method for classification trees based on the impurity-complexity m...
Decision tree pruning is critical for the construction of good decision trees. The most popular and ...
Probability trees (or Probability Estimation Trees, PET's) are decision trees with probability distr...
Abstract Probability trees are decision trees that predict class probabilities rather than the most ...
Growing amount of high dimensional data requires robust analysis techniques. Tree-based ensemble met...
Various factors aecting decision tree learning time are explored. The factors which consistently aec...
The final publication is available at Springer via http://dx.doi.org/10.1007/11875581_119Proceedings...
Top-down induction of decision trees has been observed to suer from the inadequate functioning of th...
The pruning phase is one of the necessary steps in decision tree induction. Existing pruning algorit...
By identifying relationships between regression tree construction and change-point detection, we sho...
12 pagesInternational audienceThe performance of the Classification And Regression Trees (CART) prun...
Pruning is one of the key procedures in training decision tree classifiers. It removes trivial rules...
decision tree classifiers in two learning situations: minimizing loss and probability estimation. In...