BackgroundAll-versus-all BLAST, which searches for homologous pairs of sequences in a database of proteins, is used to identify potential orthologs, to find new protein families, and to provide rapid access to these homology relationships. As DNA sequencing accelerates and data sets grow, all-versus-all BLAST has become computationally demanding.Methodology/principal findingsWe present FastBLAST, a heuristic replacement for all-versus-all BLAST that relies on alignments of proteins to known families, obtained from tools such as PSI-BLAST and HMMer. FastBLAST avoids most of the work of all-versus-all BLAST by taking advantage of these alignments and by clustering similar sequences. FastBLAST runs in two stages: the first stage identifies add...
Sequence alignment is a long standing problem in bioinformatics. The Basic Local Alignment Search To...
Sequence alignment is a long standing problem in bioinformatics. The Basic Local Alignment Search To...
Orthology inference and other sequence analyses across multiple genomes typically start by performin...
BackgroundAll-versus-all BLAST, which searches for homologous pairs of sequences in a database of pr...
As DNA sequencing accelerates and the sequence databases grow, many problems in sequence analysis ar...
Abstract Background In bioinformatics community, many tasks associate with matching a set of protein...
Metagenome sequencing efforts have provided a large pool of billions of genes for identifying enzyme...
International audienceWith genome sequencing projects producing huge amounts of sequence data, datab...
International audienceWith genome sequencing projects producing huge amounts of sequence data, datab...
International audienceWith genome sequencing projects producing huge amounts of sequence data, datab...
Background Fueled by rapid progress in high-throughput sequencing, the size of public sequence datab...
Molecular biologists, geneticists, and other life scientists use the BLAST homology search package a...
Bioinformatics was brought into the spotlight in the late 1990s through the Human Genome Project. Wi...
Motivation: Over the last decades, vast numbers of sequences were deposited in public databases. Bio...
One of the fundamental challenges in computational biology is the identification of evolutionarily r...
Sequence alignment is a long standing problem in bioinformatics. The Basic Local Alignment Search To...
Sequence alignment is a long standing problem in bioinformatics. The Basic Local Alignment Search To...
Orthology inference and other sequence analyses across multiple genomes typically start by performin...
BackgroundAll-versus-all BLAST, which searches for homologous pairs of sequences in a database of pr...
As DNA sequencing accelerates and the sequence databases grow, many problems in sequence analysis ar...
Abstract Background In bioinformatics community, many tasks associate with matching a set of protein...
Metagenome sequencing efforts have provided a large pool of billions of genes for identifying enzyme...
International audienceWith genome sequencing projects producing huge amounts of sequence data, datab...
International audienceWith genome sequencing projects producing huge amounts of sequence data, datab...
International audienceWith genome sequencing projects producing huge amounts of sequence data, datab...
Background Fueled by rapid progress in high-throughput sequencing, the size of public sequence datab...
Molecular biologists, geneticists, and other life scientists use the BLAST homology search package a...
Bioinformatics was brought into the spotlight in the late 1990s through the Human Genome Project. Wi...
Motivation: Over the last decades, vast numbers of sequences were deposited in public databases. Bio...
One of the fundamental challenges in computational biology is the identification of evolutionarily r...
Sequence alignment is a long standing problem in bioinformatics. The Basic Local Alignment Search To...
Sequence alignment is a long standing problem in bioinformatics. The Basic Local Alignment Search To...
Orthology inference and other sequence analyses across multiple genomes typically start by performin...