The current human reference sequence (GRCh38) is a foundation for large-scale sequencing projects. However, recent studies have suggested that GRCh38 may be incomplete and give a suboptimal representation of specific population groups. Here, we performed a de novo assembly of two Swedish genomes that revealed over 10 Mb of sequences absent from the human GRCh38 reference in each individual. Around 6 Mb of these novel sequences (NS) are shared with a Chinese personal genome. The NS are highly repetitive, have an elevated GC-content, and are primarily located in centromeric or telomeric regions. Up to 1 Mb of NS can be assigned to chromosome Y, and large segments are also missing from GRCh38 at chromosomes 14, 17, and 21. Inclusion of NS into...
Here we use whole-genome de novo assembly of second-generation sequencing reads to map structural va...
Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200 million base pai...
Elucidating the full spectrum of genetic variations across the human population is a fundamental pur...
The current human reference sequence (GRCh38) is a foundation for large-scale sequencing projects. H...
The human reference genome assembly plays a central role in nearly all aspects of today's basic and ...
The human genome reference (HGR) completion marked the genomics era beginning, yet despite its utili...
Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and...
The human reference genome is used extensively in modern biological research. However, a single cons...
The 1000 Genomes Project produced more than 100 trillion basepairs of short read sequence from more ...
Novel sequences (NSs), not present in the human reference genome, are abundant and remain largely un...
Over the last two decades the advancement in DNA sequencing technologies has enormously increased th...
The human genome is arguably the most complete mammalian reference assembly, yet more than 160 euchr...
A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was ...
Background: The non-reference sequences (NRS) represent structure variations in human genome with po...
Here we integrate the de novo assembly of an Asian and an African genome with the NCBI reference hum...
Here we use whole-genome de novo assembly of second-generation sequencing reads to map structural va...
Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200 million base pai...
Elucidating the full spectrum of genetic variations across the human population is a fundamental pur...
The current human reference sequence (GRCh38) is a foundation for large-scale sequencing projects. H...
The human reference genome assembly plays a central role in nearly all aspects of today's basic and ...
The human genome reference (HGR) completion marked the genomics era beginning, yet despite its utili...
Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and...
The human reference genome is used extensively in modern biological research. However, a single cons...
The 1000 Genomes Project produced more than 100 trillion basepairs of short read sequence from more ...
Novel sequences (NSs), not present in the human reference genome, are abundant and remain largely un...
Over the last two decades the advancement in DNA sequencing technologies has enormously increased th...
The human genome is arguably the most complete mammalian reference assembly, yet more than 160 euchr...
A 2.91-billion base pair (bp) consensus sequence of the euchromatic portion of the human genome was ...
Background: The non-reference sequences (NRS) represent structure variations in human genome with po...
Here we integrate the de novo assembly of an Asian and an African genome with the NCBI reference hum...
Here we use whole-genome de novo assembly of second-generation sequencing reads to map structural va...
Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200 million base pai...
Elucidating the full spectrum of genetic variations across the human population is a fundamental pur...