Abstract — In this paper we show that the normalized compression distance can be applied to gene expression data analysis. Typically, microarray-based classification involves using a feature subset selection method in connection with a specific distance metric. The performance is dependent on the selection of the methods. With our proposed approach there is no need for feature subset or distance metric selection and all the data can be used directly with the universal similarity metric. We demonstrate the method on simulated and real microarray data. I
Similarity measurement is one of the most important stages in the process of cancer discovery from g...
Classifying, clustering or building a phylogeny on a set of genomes without the expensive computatio...
Clustering genes into groups that exhibit similar expression patterns is one of the most fundamental...
In this project, we target to find effective and unsupervised feature reduction tools for gene expre...
Genomic sequences are usually compared using evolutionary distance, a procedure that implies the ali...
Background: Clustering is crucial for gene expression data analysis. As an unsupervised exploratory ...
Abstract Background The most fundamental task using gene expression data in clinical oncology is to ...
Background The search for cluster structure in microarray datasets is a base problem for the so-cal...
Normalization before clustering is often needed for proximity indices, such as Euclidian distance, w...
In this paper, we explore a representation methodology for the compression of DNA isolates. Using lo...
We present a new method for clustering based on compression. The method doesn't use subject-spe...
Recent advances in molecular biology and biotechnology have made it possible to mon-itor the activit...
Background: Clustering is crucial for gene expression data analysis. As an unsupervised explorator...
Genomic sequences are usually compared using evolutionary distance, a procedure that implies the al...
Motivation: Genome-wide gene expression measurements, as currently determined by the microarray tech...
Similarity measurement is one of the most important stages in the process of cancer discovery from g...
Classifying, clustering or building a phylogeny on a set of genomes without the expensive computatio...
Clustering genes into groups that exhibit similar expression patterns is one of the most fundamental...
In this project, we target to find effective and unsupervised feature reduction tools for gene expre...
Genomic sequences are usually compared using evolutionary distance, a procedure that implies the ali...
Background: Clustering is crucial for gene expression data analysis. As an unsupervised exploratory ...
Abstract Background The most fundamental task using gene expression data in clinical oncology is to ...
Background The search for cluster structure in microarray datasets is a base problem for the so-cal...
Normalization before clustering is often needed for proximity indices, such as Euclidian distance, w...
In this paper, we explore a representation methodology for the compression of DNA isolates. Using lo...
We present a new method for clustering based on compression. The method doesn't use subject-spe...
Recent advances in molecular biology and biotechnology have made it possible to mon-itor the activit...
Background: Clustering is crucial for gene expression data analysis. As an unsupervised explorator...
Genomic sequences are usually compared using evolutionary distance, a procedure that implies the al...
Motivation: Genome-wide gene expression measurements, as currently determined by the microarray tech...
Similarity measurement is one of the most important stages in the process of cancer discovery from g...
Classifying, clustering or building a phylogeny on a set of genomes without the expensive computatio...
Clustering genes into groups that exhibit similar expression patterns is one of the most fundamental...