In this paper, a methodological data condensation approach for reducing tabular big datasets in classification problems is presented, named FDR2-BD. The key of our proposal is to analyze data in a dual way (vertical and horizontal), so as to provide a smart combination between feature selection to generate dense clusters of data and uniform sampling reduction to keep only a few representative samples from each problem area. Its main advantage is allowing the model’s predictive quality to be kept in a range determined by a user’s threshold. Its robustness is built on a hyper-parametrization process, in which all data are taken into consideration by following a k-fold procedure. Another significant capability is being fast and scalable by usi...
Advancements in information technology is contributing to the excessive rate of big data generation ...
Analytics of Big data research has been entering the latest processes of "fast-data", in which every...
Extraordinary amounts of data are being produced in many branches of science. Proven statistical met...
In this paper, a methodological data condensation approach for reducing tabular big datasets in clas...
In this paper, a methodological data condensation approach for reducing tabular big datasets in cla...
In this paper, a methodological data condensation approach for reducing tabular big datasets in clas...
Lately, Big Data (BD) classification has become an active research area in different fields namely f...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
In the era of big data, analyzing and extracting knowledge from large-scale data sets is a very inte...
Abstract—This work has two main objectives, namely, to introduce a novel algorithm, called the Fast ...
The addition of knowledge and data has increased exponentially in the last decade or so. Previously ...
Discretization of numerical data is one of the most influential data preprocessing tasks in knowledg...
Big data comes in various ways, types, shapes, forms and sizes. Indeed, almost all areas of science,...
Advancements in information technology is contributing to the excessive rate of big data generation ...
Analytics of Big data research has been entering the latest processes of "fast-data", in which every...
Extraordinary amounts of data are being produced in many branches of science. Proven statistical met...
In this paper, a methodological data condensation approach for reducing tabular big datasets in clas...
In this paper, a methodological data condensation approach for reducing tabular big datasets in cla...
In this paper, a methodological data condensation approach for reducing tabular big datasets in clas...
Lately, Big Data (BD) classification has become an active research area in different fields namely f...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
In the era of big data, analyzing and extracting knowledge from large-scale data sets is a very inte...
Abstract—This work has two main objectives, namely, to introduce a novel algorithm, called the Fast ...
The addition of knowledge and data has increased exponentially in the last decade or so. Previously ...
Discretization of numerical data is one of the most influential data preprocessing tasks in knowledg...
Big data comes in various ways, types, shapes, forms and sizes. Indeed, almost all areas of science,...
Advancements in information technology is contributing to the excessive rate of big data generation ...
Analytics of Big data research has been entering the latest processes of "fast-data", in which every...
Extraordinary amounts of data are being produced in many branches of science. Proven statistical met...