The enormous decrease in the cost of genomic sequencing over the past two decades has enabled researchers to revisit previously unaddressable questions in sequence analysis. However, this boom of genomic information has introduced new sets of problems that often demand computationally efficient methods. In this work, we describe computational tools for two such settings involving large-scale genomic data: 1) estimating copy number and allelic variation in two highly complex gene families, and 2) selective sequencing of a target genome in a complex DNA sample.We first describe a method that takes short reads from high-throughput sequencing and characterizes both copy number and allelic variation in the IGHV and TRBV loci. These two loci can ...
Addressing many of the major outstanding questions in the fields of microbial evolution and pathogen...
In recent years, advances in the field of sequencing technologies have enabled the field of populati...
Haplotypic sequences contain significantly more information than genotypes of genetic markers and ar...
The enormous decrease in the cost of genomic sequencing over the past two decades has enabled resear...
<div><p>The study of genomic regions that contain gene copies and structural variation is a major ch...
Characterizing genetic variation including point mutations and structural variations, is key to unde...
Clonal expansion of B lymphocytes coupled with somatic mutation and antigen selection allows the mam...
Background: Multiplex polymerase chain reaction (PCR) is a common enrichment technique for targeted ...
Copy number variation (CNV) is pervasive in the human genome and has been shown to contribute signif...
Computational genomics involves the development and application of computational methods for whole-g...
Next generation sequencing (NGS) of immunoglobulin (Ig) repertoires (Rep-seq) enables examination of...
Next generation sequencing (NGS) of immunoglobulin (Ig) repertoires (Rep-seq) enables examination of...
In this thesis, I study the problem of genome inference from short-read DNA sequencing data, with th...
The diversity of the immunological repertoire has long been a subject of research focus, providing i...
Next generation sequencing (NGS) of immunoglobulin (Ig) repertoires (Rep-seq) enables examination of...
Addressing many of the major outstanding questions in the fields of microbial evolution and pathogen...
In recent years, advances in the field of sequencing technologies have enabled the field of populati...
Haplotypic sequences contain significantly more information than genotypes of genetic markers and ar...
The enormous decrease in the cost of genomic sequencing over the past two decades has enabled resear...
<div><p>The study of genomic regions that contain gene copies and structural variation is a major ch...
Characterizing genetic variation including point mutations and structural variations, is key to unde...
Clonal expansion of B lymphocytes coupled with somatic mutation and antigen selection allows the mam...
Background: Multiplex polymerase chain reaction (PCR) is a common enrichment technique for targeted ...
Copy number variation (CNV) is pervasive in the human genome and has been shown to contribute signif...
Computational genomics involves the development and application of computational methods for whole-g...
Next generation sequencing (NGS) of immunoglobulin (Ig) repertoires (Rep-seq) enables examination of...
Next generation sequencing (NGS) of immunoglobulin (Ig) repertoires (Rep-seq) enables examination of...
In this thesis, I study the problem of genome inference from short-read DNA sequencing data, with th...
The diversity of the immunological repertoire has long been a subject of research focus, providing i...
Next generation sequencing (NGS) of immunoglobulin (Ig) repertoires (Rep-seq) enables examination of...
Addressing many of the major outstanding questions in the fields of microbial evolution and pathogen...
In recent years, advances in the field of sequencing technologies have enabled the field of populati...
Haplotypic sequences contain significantly more information than genotypes of genetic markers and ar...