The rapidly increasing throughput of sequencing technologies allows us to sequence genomes, transcriptomes, and epigenomes at an unprecedented scale. Robust, efficient, and accurate computational methods to analyze sequence reads are crucial for successful large-scale studies. In this dissertation, I address specific computational and statistical challenges in quality assessment of sequence reads, ancestry-agnostic estimation of DNA sample contamination, and deconvolution of genetically multiplexed scRNA-seq sequence data by leveraging genetic variants. In Chapter 2, I describe rapid and accurate algorithms to produce comprehensive quality metrics directly from raw sequence reads without the requirement of full sequence alignment. To produc...
Background: Accurate calling of SNPs and genotypes from next-generation sequencing data is an essent...
Genetic sequencing has been recognized as an effective approach to accurately address biological pro...
Next-generation sequencing technology (NGS) enables the discovery of nearly all genetic variants pre...
The rapidly increasing throughput of sequencing technologies allows us to sequence genomes, transcri...
High-throughput genomic technologies offer powerful ways to identify genetic determinants of complex...
Genetic sequencing has been recognized as an effective approach to accurately address biological pro...
High-throughput sequencing enables basic and translational biology to query the mechanics of both li...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic r...
Short-read high-throughput sequencing is the most popular approach to collect massive amount of DNA ...
DNA sample contamination is a frequent problem in DNA sequencing studies and can result in genotypin...
Sequencing has revolutionized biology by permitting the analysis of genomic variation at an unpreced...
Detecting evidence of genetic engineering in the wild is a problem of growing importance for biosecu...
Reproducibility and robustness of genomic tools are two important factors to assess the reliability ...
Computational genomics involves the development and application of computational methods for whole-g...
Background: Accurate calling of SNPs and genotypes from next-generation sequencing data is an essent...
Genetic sequencing has been recognized as an effective approach to accurately address biological pro...
Next-generation sequencing technology (NGS) enables the discovery of nearly all genetic variants pre...
The rapidly increasing throughput of sequencing technologies allows us to sequence genomes, transcri...
High-throughput genomic technologies offer powerful ways to identify genetic determinants of complex...
Genetic sequencing has been recognized as an effective approach to accurately address biological pro...
High-throughput sequencing enables basic and translational biology to query the mechanics of both li...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic r...
Short-read high-throughput sequencing is the most popular approach to collect massive amount of DNA ...
DNA sample contamination is a frequent problem in DNA sequencing studies and can result in genotypin...
Sequencing has revolutionized biology by permitting the analysis of genomic variation at an unpreced...
Detecting evidence of genetic engineering in the wild is a problem of growing importance for biosecu...
Reproducibility and robustness of genomic tools are two important factors to assess the reliability ...
Computational genomics involves the development and application of computational methods for whole-g...
Background: Accurate calling of SNPs and genotypes from next-generation sequencing data is an essent...
Genetic sequencing has been recognized as an effective approach to accurately address biological pro...
Next-generation sequencing technology (NGS) enables the discovery of nearly all genetic variants pre...