203 pagesIn this thesis we aim to create a framework to quantify uncertainty of model predictions using the Infinitesimal Jackknife technique. We also aim to combine the principles of boosting and bagging to create higher quality predictions than those from currently used ensemble models. In the first part of the thesis we use boosting with random forests - which is a bagged estimators with decision trees as base learners. Focusing only on continuous responses which can be modelled by a Gaussian distribution, we see that this new model called "boosted forest" has much lower bias than the "base'' random forest. We also use the infinitesimal Jackknife to provide variance estimates for the boosted forest. These variance estimates are slightly ...
Decision trees and decision tree ensembles are widely used nonparametric statistical models. A decis...
We develop a Bayesian “sum-of-trees ” model where each tree is constrained by a prior to be a weak l...
Breiman (2001a,b) has recently developed an ensemble classification and regression approach that dis...
We study the variability of predictions made by bagged learners and random forests, and show how to ...
171 pagesMachine learning has become ubiquitous in many areas, including high-stake applications suc...
Ensemble models, such as bagging (Breiman, 1996), random forests (Breiman, 2001a), and boosting (Fre...
The error or variability of machine learning algorithms is often assessed by repeatedly refitting a ...
Despite the success of tree-¬based learning algorithms (bagging, boosting, random forests), these me...
This dissertation is about classification methods and class probability prediction. It can be roughl...
Recent expansions of technology led to growth and availability of different types of data. This, thu...
Understanding how "black-box" models arrive at their predictions has sparked significant interest fr...
Random forest and gradient boosting models are commonly found in publications using prediction model...
Statistical boosting is a powerful tool that has become increasingly more popular in recent years. I...
As data grows in size and complexity, scientists are relying more heavily on learning algorithms tha...
We present a statistical perspective on boosting. Special emphasis is given to estimating potentiall...
Decision trees and decision tree ensembles are widely used nonparametric statistical models. A decis...
We develop a Bayesian “sum-of-trees ” model where each tree is constrained by a prior to be a weak l...
Breiman (2001a,b) has recently developed an ensemble classification and regression approach that dis...
We study the variability of predictions made by bagged learners and random forests, and show how to ...
171 pagesMachine learning has become ubiquitous in many areas, including high-stake applications suc...
Ensemble models, such as bagging (Breiman, 1996), random forests (Breiman, 2001a), and boosting (Fre...
The error or variability of machine learning algorithms is often assessed by repeatedly refitting a ...
Despite the success of tree-¬based learning algorithms (bagging, boosting, random forests), these me...
This dissertation is about classification methods and class probability prediction. It can be roughl...
Recent expansions of technology led to growth and availability of different types of data. This, thu...
Understanding how "black-box" models arrive at their predictions has sparked significant interest fr...
Random forest and gradient boosting models are commonly found in publications using prediction model...
Statistical boosting is a powerful tool that has become increasingly more popular in recent years. I...
As data grows in size and complexity, scientists are relying more heavily on learning algorithms tha...
We present a statistical perspective on boosting. Special emphasis is given to estimating potentiall...
Decision trees and decision tree ensembles are widely used nonparametric statistical models. A decis...
We develop a Bayesian “sum-of-trees ” model where each tree is constrained by a prior to be a weak l...
Breiman (2001a,b) has recently developed an ensemble classification and regression approach that dis...