The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic regions composed primarily of two related families of tandem repeats, Human Satellites 2 and 3 (HSat2,3). The abundance of repetitive DNA in these regions challenges standard mapping and assembly algorithms, and as a result, the sequence composition and potential biological functions of these regions remain largely unexplored. Furthermore, existing genomic tools designed to predict consensus-based descriptions of repeat families cannot be readily applied to complex satellite repeats such as HSat2,3, which lack a consistent repeat unit reference sequence. Here we present an alignment-free method to characterize complex satellites using whole-genome sho...
The human genome reference (HGR) completion marked the genomics era beginning, yet despite its utili...
A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was ...
Motivation: The advent of Next Generation Sequencing (NGS) has led to the generation of enormous vol...
The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic regions c...
<div><p>The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic r...
Centromeric alpha satellite (AS) is composed of highly identical higher-order DNA repetitive sequenc...
Highly-repetitive satellite DNA (satDNA) repeats are found in most eukaryotic genomes. SatDNAs are r...
The human genome is arguably the most complete mammalian reference assembly, yet more than 160 euchr...
The central goal of medical genomics is to understand the inherited basis of sequence variation that...
Satellite DNA has been identified in varying proportions in many eukaryotic genomes. It consists of ...
In the latest hg38 human genome assembly, centromeric gaps has been filled in by alpha satellite (AS...
Thesis (Ph.D.)--University of Washington, 2021Despite their importance in disease and evolution, hig...
Large structural variants (SVs) in the human genome are difficult to detect and study by conventiona...
Tens of millions of base pairs of euchromatic human genome sequence, including many protein-coding g...
The human genome is arguably the most complete mammalian reference assembly, yet more than 160 euchr...
The human genome reference (HGR) completion marked the genomics era beginning, yet despite its utili...
A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was ...
Motivation: The advent of Next Generation Sequencing (NGS) has led to the generation of enormous vol...
The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic regions c...
<div><p>The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic r...
Centromeric alpha satellite (AS) is composed of highly identical higher-order DNA repetitive sequenc...
Highly-repetitive satellite DNA (satDNA) repeats are found in most eukaryotic genomes. SatDNAs are r...
The human genome is arguably the most complete mammalian reference assembly, yet more than 160 euchr...
The central goal of medical genomics is to understand the inherited basis of sequence variation that...
Satellite DNA has been identified in varying proportions in many eukaryotic genomes. It consists of ...
In the latest hg38 human genome assembly, centromeric gaps has been filled in by alpha satellite (AS...
Thesis (Ph.D.)--University of Washington, 2021Despite their importance in disease and evolution, hig...
Large structural variants (SVs) in the human genome are difficult to detect and study by conventiona...
Tens of millions of base pairs of euchromatic human genome sequence, including many protein-coding g...
The human genome is arguably the most complete mammalian reference assembly, yet more than 160 euchr...
The human genome reference (HGR) completion marked the genomics era beginning, yet despite its utili...
A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was ...
Motivation: The advent of Next Generation Sequencing (NGS) has led to the generation of enormous vol...