Background: Clustering of gene expression data is widely used to identify novel subtypes of cancer. Plenty of clustering approaches have been proposed, but there is a lack of knowledge regarding their relative merits and how data characteristics influence the performance. We evaluate how cluster analysis choices affect the performance by studying four publicly available human cancer data sets: breast, brain, kidney and stomach cancer. In particular, we focus on how the sample size, distribution of subtypes and sample heterogeneity affect the performance. Results: In general, increasing the sample size had limited effect on the clustering performance, e.g. for the breast cancer data similar performance was obtained for n = 40 as for n = 330....
In cancer research, class discovery is the first process for investigating a new dataset for which h...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
Background: Clustering of gene expression data is widely used to identify novel subtypes of cancer. ...
Background: Clustering of gene expression data is widely used to identify novel subtypes of cancer. ...
BACKGROUND:Clustering of gene expression data is widely used to identify novel subtypes of cancer. P...
RNA-Seq is becoming the standard technology for large-scale gene expression level measurements, as i...
Background The use of clustering methods for the discovery of cancer subtypes has drawn a great dea...
Background The use of clustering methods for the discovery of cancer subtypes has drawn a great deal...
Identification and prediction of cancer subtypes are important parts in the development towards pers...
Identification and prediction of cancer subtypes are important parts in the development towards pers...
Identification and prediction of cancer subtypes are important parts in the development towards pers...
Background: Gene expression microarray studies for several types of cancer have been reported to ide...
Clustering algorithms are extensively used on patient tissue samples in order to group and visualize...
Background The use of clustering methods for the discovery of cancer subtypes has drawn a great deal...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
Background: Clustering of gene expression data is widely used to identify novel subtypes of cancer. ...
Background: Clustering of gene expression data is widely used to identify novel subtypes of cancer. ...
BACKGROUND:Clustering of gene expression data is widely used to identify novel subtypes of cancer. P...
RNA-Seq is becoming the standard technology for large-scale gene expression level measurements, as i...
Background The use of clustering methods for the discovery of cancer subtypes has drawn a great dea...
Background The use of clustering methods for the discovery of cancer subtypes has drawn a great deal...
Identification and prediction of cancer subtypes are important parts in the development towards pers...
Identification and prediction of cancer subtypes are important parts in the development towards pers...
Identification and prediction of cancer subtypes are important parts in the development towards pers...
Background: Gene expression microarray studies for several types of cancer have been reported to ide...
Clustering algorithms are extensively used on patient tissue samples in order to group and visualize...
Background The use of clustering methods for the discovery of cancer subtypes has drawn a great deal...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
In cancer research, class discovery is the first process for investigating a new dataset for which h...