We describe an ensemble approach to learning salient spatial regions from arbitrarily partitioned simulation data. Ensemble approaches for anomaly detection are also explored. The partitioning comes from the distributed processing requirements of large-scale simulations. The volume of the data is such that classifiers can train only on data local to a given partition. Since the data partition reflects the needs of the simulation, the class statistics can vary from partition to partition. Some classes will likely be missing from some or even most partitions. We combine a fast ensemble learning algorithm with scaled probabilistic majority voting in order to learn an accurate classifier from such data. Since some simulations are difficult to m...
In real world situations every model has some weaknesses and will make errors on training data. Give...
This paper proposes a novel approach for improving the accuracy of statistical prediction methods in...
To improve the predictive power of classifiers against imbalanced data sets, this paper presents an ...
We describe an ensemble approach to learning salient spatial regions from arbitrarily partitioned si...
Many simulation data sets are so massive that they must be distributed among disk farms attached to ...
Committees of classifiers, also called mixtures or ensembles of classifiers, have become popular bec...
Many real-life problems can be described as unbalanced, where the number of instances belonging to o...
Ensembles are often capable of greater prediction accuracy than any of their individual members. As ...
This dissertation explores Machine Learning in the context of computationally intensive simulations....
Classification is one of the critical task in datamining. Many classifiers exist for classification ...
Clustering based ensemble classifiers have seen a lot of focus recently because of their ability to ...
Oversampling method is one of the effective ways to deal with imbalance classification problems. Thi...
This dissertation is about classification methods and class probability prediction. It can be roughl...
Outlier detection and ensemble learning are well established research directions in data mining yet ...
Clustering based ensemble of classifiers have shown a significant improvement in classification accu...
In real world situations every model has some weaknesses and will make errors on training data. Give...
This paper proposes a novel approach for improving the accuracy of statistical prediction methods in...
To improve the predictive power of classifiers against imbalanced data sets, this paper presents an ...
We describe an ensemble approach to learning salient spatial regions from arbitrarily partitioned si...
Many simulation data sets are so massive that they must be distributed among disk farms attached to ...
Committees of classifiers, also called mixtures or ensembles of classifiers, have become popular bec...
Many real-life problems can be described as unbalanced, where the number of instances belonging to o...
Ensembles are often capable of greater prediction accuracy than any of their individual members. As ...
This dissertation explores Machine Learning in the context of computationally intensive simulations....
Classification is one of the critical task in datamining. Many classifiers exist for classification ...
Clustering based ensemble classifiers have seen a lot of focus recently because of their ability to ...
Oversampling method is one of the effective ways to deal with imbalance classification problems. Thi...
This dissertation is about classification methods and class probability prediction. It can be roughl...
Outlier detection and ensemble learning are well established research directions in data mining yet ...
Clustering based ensemble of classifiers have shown a significant improvement in classification accu...
In real world situations every model has some weaknesses and will make errors on training data. Give...
This paper proposes a novel approach for improving the accuracy of statistical prediction methods in...
To improve the predictive power of classifiers against imbalanced data sets, this paper presents an ...