We apply hierarchical clustering (HC) of DNA k-mer counts on multiple Fastq files. The tree structures produced by HC may reflect experimental groups and thereby indicate experimental effects, but clustering of preparation groups indicates the presence of batch effects. Hence, HC of DNA k-mer counts may serve as a diagnostic device. In order to provide a simple applicable tool we implemented sequential analysis of Fastq reads with low memory usage in an R package (seqTools) available on Bioconductor. The approach is validated by analysis of Fastq file batches containing RNAseq data. Analysis of three Fastq batches downloaded from ArrayExpress indicated experimental effects. Analysis of RNAseq data from two cell types (dermal fibroblasts and...
Recently, microarray technologies have become a robust technique in the area of genomics. An importa...
Sample- and gene- based hierarchical cluster analyses have been widely adopted as tools for explorin...
Background: The commercially available 10x Genomics protocol to generate droplet-based single cell R...
The process by which DNA is transformed into gene products, such as RNA and proteins, is called gene...
Background: The commercially available 10x Genomics protocol to generate droplet-based single cell R...
Genomic sequences can be viewed as special types of documents. These are typically organised and sto...
RNA-Seq is becoming the standard technology for large-scale gene expression level measurements, as i...
High-throughput sequencing (HTS) refers to the simultaneous sequencing of millions of fragments of D...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
Background: Clustering of gene expression data is widely used to identify novel subtypes of cancer. ...
Single-cell RNA-seq (scRNAseq) is a powerful tool to study heterogeneity of cells. Recently, several...
<p>Hierarchical clustering of the 2414 contigs among 3 groups (from left to right: resistant, subcli...
When applying hierarchical clustering algorithms to cluster patient samples from microarray data, th...
As the next generation sequencing (NGS) becomes the dominating technology for studying the gene expr...
BACKGROUND:Clustering of gene expression data is widely used to identify novel subtypes of cancer. P...
Recently, microarray technologies have become a robust technique in the area of genomics. An importa...
Sample- and gene- based hierarchical cluster analyses have been widely adopted as tools for explorin...
Background: The commercially available 10x Genomics protocol to generate droplet-based single cell R...
The process by which DNA is transformed into gene products, such as RNA and proteins, is called gene...
Background: The commercially available 10x Genomics protocol to generate droplet-based single cell R...
Genomic sequences can be viewed as special types of documents. These are typically organised and sto...
RNA-Seq is becoming the standard technology for large-scale gene expression level measurements, as i...
High-throughput sequencing (HTS) refers to the simultaneous sequencing of millions of fragments of D...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
Background: Clustering of gene expression data is widely used to identify novel subtypes of cancer. ...
Single-cell RNA-seq (scRNAseq) is a powerful tool to study heterogeneity of cells. Recently, several...
<p>Hierarchical clustering of the 2414 contigs among 3 groups (from left to right: resistant, subcli...
When applying hierarchical clustering algorithms to cluster patient samples from microarray data, th...
As the next generation sequencing (NGS) becomes the dominating technology for studying the gene expr...
BACKGROUND:Clustering of gene expression data is widely used to identify novel subtypes of cancer. P...
Recently, microarray technologies have become a robust technique in the area of genomics. An importa...
Sample- and gene- based hierarchical cluster analyses have been widely adopted as tools for explorin...
Background: The commercially available 10x Genomics protocol to generate droplet-based single cell R...