The introduction of massively parallel short-read sequencing has facilitated rapidly dropping costs of DNA sequencing. This has led to substantial growth in the size of human sequencing projects, with consortia of low coverage sequencing data containing tens of thousands of samples. However, current statistical methods for genotype calling from this data scale poorly with sample size, and are infeasible to use on the largest of current projects. This thesis explores the problem of genotype calling and phasing of large sample sizes of low-coverage sequencing data. Current methods are applied to call and phase genotypes of the CONVERGE consortium, a data set consisting of very low coverage next-generation sequencing data collected from around...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide associa...
Additional file 1: Figure S1. Expected haplotype imputation accuracy against the accumulated haploty...
The introduction of massively parallel short-read sequencing has facilitated rapidly dropping costs ...
Low-coverage whole genome sequencing followed by imputation has been proposed as a cost-effective ge...
Abstract Background This paper describes a heuristic method for allocating low-coverage sequencing r...
MOTIVATION: Given the current costs of next-generation sequencing, large studies carry out low-cover...
A major use of the 1000 Genomes Project (1000GP) data is genotype imputation in genome-wide associat...
<div><p>High coverage whole genome sequencing provides near complete information about genetic varia...
textabstractImputing genotypes from reference panels created by whole-genome sequencing (WGS) provid...
High coverage whole genome sequencing provides near complete information about genetic variation. Ho...
Genotype microarrays assay hundreds of thousands of genetic variants on an individual's genome. The ...
Imputing genotypes from reference panels created by whole-genome sequencing (WGS) provides a cost-ef...
Next-generation sequencing is revolutionising in genetics, where base-by base information for the wh...
Inexpensive genotyping methods are essential for genetic studies requiring large sample sizes. In hu...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide associa...
Additional file 1: Figure S1. Expected haplotype imputation accuracy against the accumulated haploty...
The introduction of massively parallel short-read sequencing has facilitated rapidly dropping costs ...
Low-coverage whole genome sequencing followed by imputation has been proposed as a cost-effective ge...
Abstract Background This paper describes a heuristic method for allocating low-coverage sequencing r...
MOTIVATION: Given the current costs of next-generation sequencing, large studies carry out low-cover...
A major use of the 1000 Genomes Project (1000GP) data is genotype imputation in genome-wide associat...
<div><p>High coverage whole genome sequencing provides near complete information about genetic varia...
textabstractImputing genotypes from reference panels created by whole-genome sequencing (WGS) provid...
High coverage whole genome sequencing provides near complete information about genetic variation. Ho...
Genotype microarrays assay hundreds of thousands of genetic variants on an individual's genome. The ...
Imputing genotypes from reference panels created by whole-genome sequencing (WGS) provides a cost-ef...
Next-generation sequencing is revolutionising in genetics, where base-by base information for the wh...
Inexpensive genotyping methods are essential for genetic studies requiring large sample sizes. In hu...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide associa...
Additional file 1: Figure S1. Expected haplotype imputation accuracy against the accumulated haploty...