Motivation Pangenome graphs representing aligned genome assemblies are being shared in the text-based Graphical Fragment Assembly format. As the number of assemblies grows, there is a need for a file format that can store the highly repetitive data space-efficiently. Results We propose the GBZ file format based on data structures used in the Giraffe short read aligner. The format provides good compression, and the files can be efficiently loaded into in-memory data structures. We provide compression and decompression tools and libraries for using GBZ graphs, and we show that they can be efficiently used on a variety of systems. Availability C++ and Rust implementations are available at https://github.com/jltsiren/gbwtgraph and https://githu...
The amount of sequence data has increased exponentially during the last decade. This applies especia...
Next generation sequencers produce billions of short DNA sequences in a massively parallel manner, w...
International audienceBackgroundNext Generation Sequencing (NGS) has dramatically enhanced our abili...
MOTIVATION: Pangenome graphs representing aligned genome assemblies are being shared in the text-bas...
A Giraffe and a GFA-formatted minigraph assembly of sixteen bread wheat cultivar genome assemblies. ...
Low-cost whole-genome assembly has enabled the collection of haplotype-resolved pangenomes for numer...
PGGB builds pangenome variation graphs from a set of input sequences. A pangenome variation graph c...
Pangenomes of multiple species for the PGGB paper. Each pangenome is represented in a FASTA format ...
ABSTRACT We introduce Giraffe, a pangenome short read mapper that can efficiently map to a collectio...
GFAKluge is a set of command line utilities and a C++ library for parsing and manipulating the Graph...
Holley G, Wittler R, Stoye J, Hach F. Dynamic Alignment-Free and Reference-Free Read Compression. JO...
International audienceLong reads and Hi-C have revolutionized the field of genome assembly as they h...
Pangenome references address biases of reference genomes by storing a representative set of diverse ...
Herein, we uploaded the compressed file titled "Primate_Genome_Annotation_GFF_Files.zip". In this zi...
gfaestus is a pangenome graph browser for GFA files. It reads graphs in GFAv1 format, with 2D layout...
The amount of sequence data has increased exponentially during the last decade. This applies especia...
Next generation sequencers produce billions of short DNA sequences in a massively parallel manner, w...
International audienceBackgroundNext Generation Sequencing (NGS) has dramatically enhanced our abili...
MOTIVATION: Pangenome graphs representing aligned genome assemblies are being shared in the text-bas...
A Giraffe and a GFA-formatted minigraph assembly of sixteen bread wheat cultivar genome assemblies. ...
Low-cost whole-genome assembly has enabled the collection of haplotype-resolved pangenomes for numer...
PGGB builds pangenome variation graphs from a set of input sequences. A pangenome variation graph c...
Pangenomes of multiple species for the PGGB paper. Each pangenome is represented in a FASTA format ...
ABSTRACT We introduce Giraffe, a pangenome short read mapper that can efficiently map to a collectio...
GFAKluge is a set of command line utilities and a C++ library for parsing and manipulating the Graph...
Holley G, Wittler R, Stoye J, Hach F. Dynamic Alignment-Free and Reference-Free Read Compression. JO...
International audienceLong reads and Hi-C have revolutionized the field of genome assembly as they h...
Pangenome references address biases of reference genomes by storing a representative set of diverse ...
Herein, we uploaded the compressed file titled "Primate_Genome_Annotation_GFF_Files.zip". In this zi...
gfaestus is a pangenome graph browser for GFA files. It reads graphs in GFAv1 format, with 2D layout...
The amount of sequence data has increased exponentially during the last decade. This applies especia...
Next generation sequencers produce billions of short DNA sequences in a massively parallel manner, w...
International audienceBackgroundNext Generation Sequencing (NGS) has dramatically enhanced our abili...