In this paper, a methodological data condensation approach for reducing tabular big datasets in classification problems is presented, named FDR²-BD. The key of our proposal is to analyze data in a dual way (vertical and horizontal), so as to provide a smart combination between feature selection to generate dense clusters of data and uniform sampling reduction to keep only a few representative samples from each problem area. Its main advantage is allowing the model’s predictive quality to be kept in a range determined by a user’s threshold. Its robustness is built on a hyper-parametrization process, in which all data are taken into consideration by following a k-fold procedure. Another significant capability is being fast and scalable by usi...
Extraordinary amounts of data are being produced in many branches of science. Proven statistical met...
Analytics of Big data research has been entering the latest processes of "fast-data", in which every...
Abstract- Clustering is the unsupervised classification of patterns (data items) into groups (cluste...
In this paper, a methodological data condensation approach for reducing tabular big datasets in clas...
In this paper, a methodological data condensation approach for reducing tabular big datasets in cla...
In this paper, a methodological data condensation approach for reducing tabular big datasets in clas...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
Lately, Big Data (BD) classification has become an active research area in different fields namely f...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
The addition of knowledge and data has increased exponentially in the last decade or so. Previously ...
Big data comes in various ways, types, shapes, forms and sizes. Indeed, almost all areas of science,...
Discretization of numerical data is one of the most influential data preprocessing tasks in knowledg...
Advancements in information technology is contributing to the excessive rate of big data generation ...
In the era of big data, analyzing and extracting knowledge from large-scale data sets is a very inte...
Extraordinary amounts of data are being produced in many branches of science. Proven statistical met...
Analytics of Big data research has been entering the latest processes of "fast-data", in which every...
Abstract- Clustering is the unsupervised classification of patterns (data items) into groups (cluste...
In this paper, a methodological data condensation approach for reducing tabular big datasets in clas...
In this paper, a methodological data condensation approach for reducing tabular big datasets in cla...
In this paper, a methodological data condensation approach for reducing tabular big datasets in clas...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
Lately, Big Data (BD) classification has become an active research area in different fields namely f...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
The big data concept has elicited studies on how to accurately and efficiently extract valuable info...
The addition of knowledge and data has increased exponentially in the last decade or so. Previously ...
Big data comes in various ways, types, shapes, forms and sizes. Indeed, almost all areas of science,...
Discretization of numerical data is one of the most influential data preprocessing tasks in knowledg...
Advancements in information technology is contributing to the excessive rate of big data generation ...
In the era of big data, analyzing and extracting knowledge from large-scale data sets is a very inte...
Extraordinary amounts of data are being produced in many branches of science. Proven statistical met...
Analytics of Big data research has been entering the latest processes of "fast-data", in which every...
Abstract- Clustering is the unsupervised classification of patterns (data items) into groups (cluste...