Real-world datasets, such as genomic data, are noisy and high-dimensional, and are therefore difficult to analyse without a preliminary step aimed to reduce data dimensionality and to select relevant features. Projection techniques are a useful tool to pre-process high dimensional datd since they allow to achieve a simpler representation of the original data that still preserves intrinsic information. In this work, we assess the effectiveness of these methods when applied to two common tasks in Bioinformatics: patient classification and gene clustering. We compared the performance of different learning models in the original space and in several projected spaces obtained with different techniques, both in a supervised and in an unsupervised...
Abstract. Protein mass spectrometry (MS) pattern recognition has recently emerged as a new method fo...
The scale of genetic, epigenomic, transcriptomic, cheminformatic and proteomic data available today,...
Genotype data, consisting large numbers of markers, is used as demographic and association studies t...
Real-world datasets, such as genomic data, are noisy and high-dimensional, and are therefore difficu...
We present a novel method for finding low dimensional views of high dimensional data: Targeted Proje...
The microarray DNA technologies have given researchers the ability to examine, discover and monitor ...
In this paper we present a GP-based method for automatically evolve projections, so that data can be...
In this paper we present a GP-based method for automatically evolve projections, so that data can be...
Abstract Background With the advance of microarray technology, several methods for gene classificati...
Clustering is one of the most well known activities in scientific investigation and the object of re...
Depuis le premier séquençage du génome humain au début des années 2000, de grandes initiatives se so...
The analysis of gene expression data involves the observation of a very large number of variables (g...
This thesis studies exploratory cluster analysis of genomic high-throughput data sets and their inte...
Dimensionality reduction is a compelling alternative for high-dimensional data visualization. This m...
This research evaluates pattern recognition techniques on a subclass of big data where the dimension...
Abstract. Protein mass spectrometry (MS) pattern recognition has recently emerged as a new method fo...
The scale of genetic, epigenomic, transcriptomic, cheminformatic and proteomic data available today,...
Genotype data, consisting large numbers of markers, is used as demographic and association studies t...
Real-world datasets, such as genomic data, are noisy and high-dimensional, and are therefore difficu...
We present a novel method for finding low dimensional views of high dimensional data: Targeted Proje...
The microarray DNA technologies have given researchers the ability to examine, discover and monitor ...
In this paper we present a GP-based method for automatically evolve projections, so that data can be...
In this paper we present a GP-based method for automatically evolve projections, so that data can be...
Abstract Background With the advance of microarray technology, several methods for gene classificati...
Clustering is one of the most well known activities in scientific investigation and the object of re...
Depuis le premier séquençage du génome humain au début des années 2000, de grandes initiatives se so...
The analysis of gene expression data involves the observation of a very large number of variables (g...
This thesis studies exploratory cluster analysis of genomic high-throughput data sets and their inte...
Dimensionality reduction is a compelling alternative for high-dimensional data visualization. This m...
This research evaluates pattern recognition techniques on a subclass of big data where the dimension...
Abstract. Protein mass spectrometry (MS) pattern recognition has recently emerged as a new method fo...
The scale of genetic, epigenomic, transcriptomic, cheminformatic and proteomic data available today,...
Genotype data, consisting large numbers of markers, is used as demographic and association studies t...