Feature selection targets the identification of which features of a dataset are relevant to the learning task. It is also widely known and used to improve computation times, reduce computation requirements, and to decrease the impact of the curse of dimensionality and enhancing the generalization rates of classifiers. In data streams, classifiers shall benefit from all the items above, but more importantly, from the fact that the relevant subset of features may drift over time. In this paper, we propose a novel dynamic feature selection method for data streams called Adaptive Boosting for Feature Selection (ABFS). ABFS chains decision stumps and drift detectors, and as a result, identifies which features are relevant to the learning task as...
The term “data-drift” refers to a difference between the data used to test and validate a model and ...
As dimensions of datasets in predictive modelling continue to grow, feature selection becomes increa...
The rise of network connected devices and applications leads to a significant increase in the volume...
Feature selection targets the identification of which features of a dataset are relevant to the lear...
Data stream mining is a fast growing research topic due to the ubiquity of data in several real-worl...
The ubiquity of data streams has been encouraging the development of new incremental and adaptive le...
Traditional active learning tries to identify instances for which the acquisition of the label incre...
Orientador: André Leon Sampaio GradvohlDissertação (mestrado) - Universidade Estadual de Campinas, F...
Advances in hardware and software in the past decade allow to capture, record and process fast data ...
This paper proposes a boosting-like method to train a classifier ensemble from data streams. It natu...
We propose a new online feature selection framework for applications with streaming features where t...
The training set consists of many features that influence the classifier in different degrees. Choos...
Most data stream classification techniques assume that the underlying feature space is static. Howev...
Data streams are transmitted at high speeds with huge volume and may contain critical information ne...
Data streams are unbounded, sequential data instances that are generated with high Velocity. Data s...
The term “data-drift” refers to a difference between the data used to test and validate a model and ...
As dimensions of datasets in predictive modelling continue to grow, feature selection becomes increa...
The rise of network connected devices and applications leads to a significant increase in the volume...
Feature selection targets the identification of which features of a dataset are relevant to the lear...
Data stream mining is a fast growing research topic due to the ubiquity of data in several real-worl...
The ubiquity of data streams has been encouraging the development of new incremental and adaptive le...
Traditional active learning tries to identify instances for which the acquisition of the label incre...
Orientador: André Leon Sampaio GradvohlDissertação (mestrado) - Universidade Estadual de Campinas, F...
Advances in hardware and software in the past decade allow to capture, record and process fast data ...
This paper proposes a boosting-like method to train a classifier ensemble from data streams. It natu...
We propose a new online feature selection framework for applications with streaming features where t...
The training set consists of many features that influence the classifier in different degrees. Choos...
Most data stream classification techniques assume that the underlying feature space is static. Howev...
Data streams are transmitted at high speeds with huge volume and may contain critical information ne...
Data streams are unbounded, sequential data instances that are generated with high Velocity. Data s...
The term “data-drift” refers to a difference between the data used to test and validate a model and ...
As dimensions of datasets in predictive modelling continue to grow, feature selection becomes increa...
The rise of network connected devices and applications leads to a significant increase in the volume...