Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally performed via statistical phasing in a genotyped cohort, an approach that can yield high accuracy in very large cohorts but attains lower accuracy in smaller cohorts. Here we instead explore the paradigm of reference-based phasing. We introduce a new phasing algorithm, Eagle2, that attains high accuracy across a broad range of cohort sizes by efficiently leveraging information from large external reference panels (such as the Haplotype Reference Consortium; HRC) using a new data structure based on the positional Burrows-Wheeler transform. We demonstrate that Eagle2 attains a ∼20× speedup and ∼10% increase in accuracy compared to reference-bas...
Motivation: There is growing recognition that estimating haplotypes from high coverage sequencing of...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
We describe a reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole-...
Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally ...
The 1000 Genomes Project and disease-specific sequencing efforts are producing large collections of ...
The 1000 Genomes Project and disease-specific sequencing efforts are producing large collections of ...
The genome-wide association study (GWAS) has become a routine approach for mapping disease risk loci...
The valuable information in correct order of alleles on the haplotypes has many applications in GWAS...
The valuable information in correct order of alleles on the haplotypes has many applications in GWAS...
BACKGROUND: Knowledge of phase, the specific allele sequence on each copy of homologous chromosomes,...
A haplotype is the sequence of nucleotides along a single chromosome. As humans, we have 23 pairs of...
Genotype imputation is the process of predicting unobserved genotypes in a sample of individuals usi...
BACKGROUND: Haplotype reconstruction (phasing) is an essential step in many applications, including ...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
Motivation: There is growing recognition that estimating haplotypes from high coverage sequencing of...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
We describe a reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole-...
Haplotype phasing is a fundamental problem in medical and population genetics. Phasing is generally ...
The 1000 Genomes Project and disease-specific sequencing efforts are producing large collections of ...
The 1000 Genomes Project and disease-specific sequencing efforts are producing large collections of ...
The genome-wide association study (GWAS) has become a routine approach for mapping disease risk loci...
The valuable information in correct order of alleles on the haplotypes has many applications in GWAS...
The valuable information in correct order of alleles on the haplotypes has many applications in GWAS...
BACKGROUND: Knowledge of phase, the specific allele sequence on each copy of homologous chromosomes,...
A haplotype is the sequence of nucleotides along a single chromosome. As humans, we have 23 pairs of...
Genotype imputation is the process of predicting unobserved genotypes in a sample of individuals usi...
BACKGROUND: Haplotype reconstruction (phasing) is an essential step in many applications, including ...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
Motivation: There is growing recognition that estimating haplotypes from high coverage sequencing of...
High-throughput sequencing technologies produce short sequence reads that can contain phase informat...
We describe a reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole-...