We describe an ensemble approach to learning salient spatial regions from arbitrarily partitioned simulation data. Ensemble approaches for anomaly detection are also explored. The partitioning comes from the distributed processing requirements of large-scale simulations. The volume of the data is such that classifiers can train only on data local to a given partition. Since the data partition reflects the needs of the simulation, the class statistics can vary from partition to partition. Some classes will likely be missing from some or even most partitions. We combine a fast ensemble learning algorithm with scaled probabilistic majority voting in order to learn an accurate classifier from such data. Since some simulations are difficult to m...
There is an unprecedented amount of data available. This has caused knowledge discovery to garner at...
Anomaly detection has many applications in numerous areas such as intrusion detection, fraud detecti...
With the availability of high-speed Internet and the advent of Internet of Things devices, modern so...
We describe an ensemble approach to learning salient spatial regions from arbitrarily partitioned si...
Semi-supervised self-learning algorithms have been shown to improve classifier accuracy under a vari...
Many simulation data sets are so massive that they must be distributed among disk farms attached to ...
This dissertation explores Machine Learning in the context of computationally intensive simulations....
The class imbalance problem is prevalent in many domains including medical, natural language process...
Many real-life problems can be described as unbalanced, where the number of instances belonging to o...
In this thesis several sampling methods for Statistical Learning with imbalanced data have been impl...
Ensembles are often capable of greater prediction accuracy than any of their individual members. As ...
Many real world applications inevitably contain datasets that have multiclass structure characterize...
Classification methods usually exhibit a poor performance when they are applied on imbalanced data s...
We present an extension to the federated ensemble regression using classification algorithm, an ense...
[[abstract]]It is difficult for learning models to achieve high classification performances with imb...
There is an unprecedented amount of data available. This has caused knowledge discovery to garner at...
Anomaly detection has many applications in numerous areas such as intrusion detection, fraud detecti...
With the availability of high-speed Internet and the advent of Internet of Things devices, modern so...
We describe an ensemble approach to learning salient spatial regions from arbitrarily partitioned si...
Semi-supervised self-learning algorithms have been shown to improve classifier accuracy under a vari...
Many simulation data sets are so massive that they must be distributed among disk farms attached to ...
This dissertation explores Machine Learning in the context of computationally intensive simulations....
The class imbalance problem is prevalent in many domains including medical, natural language process...
Many real-life problems can be described as unbalanced, where the number of instances belonging to o...
In this thesis several sampling methods for Statistical Learning with imbalanced data have been impl...
Ensembles are often capable of greater prediction accuracy than any of their individual members. As ...
Many real world applications inevitably contain datasets that have multiclass structure characterize...
Classification methods usually exhibit a poor performance when they are applied on imbalanced data s...
We present an extension to the federated ensemble regression using classification algorithm, an ense...
[[abstract]]It is difficult for learning models to achieve high classification performances with imb...
There is an unprecedented amount of data available. This has caused knowledge discovery to garner at...
Anomaly detection has many applications in numerous areas such as intrusion detection, fraud detecti...
With the availability of high-speed Internet and the advent of Internet of Things devices, modern so...