Determining whether two cells have the same genotype is a key problem in forensic deoxyribonucleic acid (DNA) analysis using electropherograms (EPGs). Single-cell extraction of DNA has become more practical in recent years and holds promise in reducing the complexity of downstream data interpretation. Clustering of EPG data is a potential way to aid downstream interpretation. We explore many clustering techniques. The clustering methods that meet certain criteria are selected for further experiments with combinations of sample types and parameters including two proposed per-locus feature transformations. The best performing clusterers form cluster ensembles for further testing. Clustering performances are measured and compared to the baseli...
In this work we seek clusters of genomic words in human DNA by studying their inter-word lag distrib...
It is a crucial need for a clustering technique to produce high-quality clusters from biomedical and...
Several methods exist to detect shared genetic ancestry or to identify population substructure using...
Cells can be linked to the person who produced them by examining the information contained within t...
Current analysis of forensic DNA stains relies on the probabilistic interpretation of bulk-processed...
Affymetrix high-density oligonucleotide microarrays measure expression of DNA transcripts using prob...
DNA is now routinely used in criminal investigations and court cases, although DNA samples taken at ...
We apply hierarchical clustering (HC) of DNA k-mer counts on multiple Fastq files. The tree structur...
The process by which DNA is transformed into gene products, such as RNA and proteins, is called gene...
As the next generation sequencing (NGS) becomes the dominating technology for studying the gene expr...
Motivation: Accurately clustering cell types from a mass of heterogeneous cells is a crucial first s...
Clustering techniques are used to arrange genes in some natural way, that is, to organize genes into...
The DNA data are huge multidimensional which contains the simultaneous gene expression and it uses t...
RNA-Seq is becoming the standard technology for large-scale gene expression level measurements, as i...
Abstract—Dealing with data means to group information into a set of categories either in order to le...
In this work we seek clusters of genomic words in human DNA by studying their inter-word lag distrib...
It is a crucial need for a clustering technique to produce high-quality clusters from biomedical and...
Several methods exist to detect shared genetic ancestry or to identify population substructure using...
Cells can be linked to the person who produced them by examining the information contained within t...
Current analysis of forensic DNA stains relies on the probabilistic interpretation of bulk-processed...
Affymetrix high-density oligonucleotide microarrays measure expression of DNA transcripts using prob...
DNA is now routinely used in criminal investigations and court cases, although DNA samples taken at ...
We apply hierarchical clustering (HC) of DNA k-mer counts on multiple Fastq files. The tree structur...
The process by which DNA is transformed into gene products, such as RNA and proteins, is called gene...
As the next generation sequencing (NGS) becomes the dominating technology for studying the gene expr...
Motivation: Accurately clustering cell types from a mass of heterogeneous cells is a crucial first s...
Clustering techniques are used to arrange genes in some natural way, that is, to organize genes into...
The DNA data are huge multidimensional which contains the simultaneous gene expression and it uses t...
RNA-Seq is becoming the standard technology for large-scale gene expression level measurements, as i...
Abstract—Dealing with data means to group information into a set of categories either in order to le...
In this work we seek clusters of genomic words in human DNA by studying their inter-word lag distrib...
It is a crucial need for a clustering technique to produce high-quality clusters from biomedical and...
Several methods exist to detect shared genetic ancestry or to identify population substructure using...