Ensemble methods using the same underlying algorithm trained on different subsets of observations have recently received increased attention as practical prediction tools for massive data sets. We propose Subsemble, a general subset ensemble prediction method, which can be used for small, moderate, or large data sets. Subsemble partitions the full data set into subsets of observations, fits one or more user-specified underlying algorithm on each subset, and uses a clever form of V-fold cross-validation to output a prediction function that combines the subset-specific fits through a second user-specified metalearner algorithm. We give an oracle result that provides a theoretical performance guarantee for Subsemble. Through simulations, w...
This paper introduces a new ensemble approach, Feature-Subspace Aggregating (Feating), which builds ...
We present a series of learning algorithms and theoretical guarantees for designing accurate en-semb...
We present a series of learning algorithms and theoretical guarantees for designing accurate en-semb...
Ensemble methods using the same underlying algorithm trained on different subsets of observations ha...
Subsemble is a flexible ensemble method that partitions a full data set into subsets of observations...
Ensemble methods using the same underlying algorithm trained on different subsets of observations ha...
Subsemble is a general subset ensemble prediction method, which can be used for small, moderate, or ...
2011-07-29In this dissertation, we study the subset selection problem for prediction. It deals with ...
International audienceLimited area ensemble predictions can be sensitive to the specification of lat...
Sparse and ensemble methods are the two main approaches in the statistical literature for modeling h...
The motivation of this work is to improve the performance of standard stacking approaches or ensembl...
Abstract : The objective is to provide methods to improve the performance, or prediction accuracy of...
We propose a new regression algorithm that learns from a set of input-output pairs. Our algorithm is...
Ensemble machine learning methods are often used when the true prediction function is not easily app...
International audienceThe success of machine learning (ML) systems depends on data availability, vol...
This paper introduces a new ensemble approach, Feature-Subspace Aggregating (Feating), which builds ...
We present a series of learning algorithms and theoretical guarantees for designing accurate en-semb...
We present a series of learning algorithms and theoretical guarantees for designing accurate en-semb...
Ensemble methods using the same underlying algorithm trained on different subsets of observations ha...
Subsemble is a flexible ensemble method that partitions a full data set into subsets of observations...
Ensemble methods using the same underlying algorithm trained on different subsets of observations ha...
Subsemble is a general subset ensemble prediction method, which can be used for small, moderate, or ...
2011-07-29In this dissertation, we study the subset selection problem for prediction. It deals with ...
International audienceLimited area ensemble predictions can be sensitive to the specification of lat...
Sparse and ensemble methods are the two main approaches in the statistical literature for modeling h...
The motivation of this work is to improve the performance of standard stacking approaches or ensembl...
Abstract : The objective is to provide methods to improve the performance, or prediction accuracy of...
We propose a new regression algorithm that learns from a set of input-output pairs. Our algorithm is...
Ensemble machine learning methods are often used when the true prediction function is not easily app...
International audienceThe success of machine learning (ML) systems depends on data availability, vol...
This paper introduces a new ensemble approach, Feature-Subspace Aggregating (Feating), which builds ...
We present a series of learning algorithms and theoretical guarantees for designing accurate en-semb...
We present a series of learning algorithms and theoretical guarantees for designing accurate en-semb...