In this paper, we address the challenging task of learning accurate classifiers from micro-array datasets involving a large number of features but only a small number of samples. We present a greedy step-by-step procedure (SSFS) that can be used to reduce the dimensionality of the feature space. We apply the Minimum Description Length principle to the training data for weighting each feature and then select an “optimal” feature subset by a greedy approach tuned to a specific classifier. The Acute Lymphoblastic Leukemia dataset is used to evaluate the effectiveness of the SSFS procedure in conjunction with different state-of-the-art classification algorithms
The microarrays report the measures of the expression levels of tens of thousands of genes, this hig...
Microarray dataset dimensionality reduction is a prerequisite for avoiding overfitting, and hence de...
This research evaluates pattern recognition techniques on a subclass of big data where the dimension...
In this paper, we address the challenging task of learning accurate classifiers from micro-array dat...
This paper reports on three machine learning methods, i.e. Naïve Bayes (NB), Adaptive Bayesian Netw...
In numerous classification problems, the number of available samples to be used in the classifier tr...
Feature selection approach solves the dimensionality problem by removing irrelevant and redundant fe...
Classification of micro-array data has been studied extensively but only a small amount of research ...
Accurate classification of DNA microarray data is vital for cancer diagnosis and treatment. For grea...
Abstract Background Data generated using 'omics' technologies are characterized by high dimensionali...
Supervised learning techniques designed for the situation when the dimensionality exceeds the sample...
In many technological or industrial fields, the amount of high dimensional data is steadily growing....
A large pool of techniques have already been developed for analyzing micro-array datasets but less ...
Computing methods that allow the efficient and accurate processing of experimentally gathered data p...
When dealing with biomedical data, the first and most challenging issue is often the huge dimensiona...
The microarrays report the measures of the expression levels of tens of thousands of genes, this hig...
Microarray dataset dimensionality reduction is a prerequisite for avoiding overfitting, and hence de...
This research evaluates pattern recognition techniques on a subclass of big data where the dimension...
In this paper, we address the challenging task of learning accurate classifiers from micro-array dat...
This paper reports on three machine learning methods, i.e. Naïve Bayes (NB), Adaptive Bayesian Netw...
In numerous classification problems, the number of available samples to be used in the classifier tr...
Feature selection approach solves the dimensionality problem by removing irrelevant and redundant fe...
Classification of micro-array data has been studied extensively but only a small amount of research ...
Accurate classification of DNA microarray data is vital for cancer diagnosis and treatment. For grea...
Abstract Background Data generated using 'omics' technologies are characterized by high dimensionali...
Supervised learning techniques designed for the situation when the dimensionality exceeds the sample...
In many technological or industrial fields, the amount of high dimensional data is steadily growing....
A large pool of techniques have already been developed for analyzing micro-array datasets but less ...
Computing methods that allow the efficient and accurate processing of experimentally gathered data p...
When dealing with biomedical data, the first and most challenging issue is often the huge dimensiona...
The microarrays report the measures of the expression levels of tens of thousands of genes, this hig...
Microarray dataset dimensionality reduction is a prerequisite for avoiding overfitting, and hence de...
This research evaluates pattern recognition techniques on a subclass of big data where the dimension...