DNA sequences are translated into protein coding sequences and then further assigned to protein families in metagenomic analyses, because of the need for sensitivity. However, huge amounts of sequence data create the problem that even general homology search analyses using BLASTX become difficult in terms of computational cost. We designed a new homology search algorithm that finds seed sequences based on the suffix arrays of a query and a database, and have implemented it as GHOSTX. GHOSTX achieved approximately 131-165 times acceleration over a BLASTX search at similar levels of sensitivity. GHOSTX is distributed under the BSD 2-clause license and is available for download at http://www.bi.cs.titech.ac.jp/ghostx/. Currently, sequencing te...
With the increasing amount of DNA sequence information deposited in our databases searching for simi...
Data generated by metagenomic and metatranscriptomic experiments is both enormous and inherently noi...
With the increasing amount of DNA sequence information deposited in our databases searching for simi...
<div><p>DNA sequences are translated into protein coding sequences and then further assigned to prot...
A large number of sensitive homology searches are required for mapping DNA sequence fragments to kno...
Sequence similarity searches have been widely used in the analyses of metagenomic sequencing data. F...
Protein homology search is an important, yet time-consuming, step in everything from protein annotat...
A new method for homology search of DNA sequences is suggested. This method may be used to find exte...
Motivation: Homology search finds similar segments between two biological sequences, such as DNA or ...
Bioinformatics was brought into the spotlight in the late 1990s through the Human Genome Project. Wi...
Motivation: Many bioinformatic approaches exist for find-ing novel genes within genomic sequence dat...
Abstract Background Homology is a key concept in both evolutionary biology and genomics. Detection o...
Efficient and accurate search in biological sequence databases remains a matter of priority due to t...
Biology researchers have a pressing need for data management technologies which will make the storag...
Background/Objectives: In the biological sequences, palindromes can create structures that differ fr...
With the increasing amount of DNA sequence information deposited in our databases searching for simi...
Data generated by metagenomic and metatranscriptomic experiments is both enormous and inherently noi...
With the increasing amount of DNA sequence information deposited in our databases searching for simi...
<div><p>DNA sequences are translated into protein coding sequences and then further assigned to prot...
A large number of sensitive homology searches are required for mapping DNA sequence fragments to kno...
Sequence similarity searches have been widely used in the analyses of metagenomic sequencing data. F...
Protein homology search is an important, yet time-consuming, step in everything from protein annotat...
A new method for homology search of DNA sequences is suggested. This method may be used to find exte...
Motivation: Homology search finds similar segments between two biological sequences, such as DNA or ...
Bioinformatics was brought into the spotlight in the late 1990s through the Human Genome Project. Wi...
Motivation: Many bioinformatic approaches exist for find-ing novel genes within genomic sequence dat...
Abstract Background Homology is a key concept in both evolutionary biology and genomics. Detection o...
Efficient and accurate search in biological sequence databases remains a matter of priority due to t...
Biology researchers have a pressing need for data management technologies which will make the storag...
Background/Objectives: In the biological sequences, palindromes can create structures that differ fr...
With the increasing amount of DNA sequence information deposited in our databases searching for simi...
Data generated by metagenomic and metatranscriptomic experiments is both enormous and inherently noi...
With the increasing amount of DNA sequence information deposited in our databases searching for simi...