Database scanning programs such as BLAST and FASTA are used nowadays by most biologists for the post-genomic processing of DNA or protein sequence information (in particular to retrieve the structure/function of uncharacterized proteins). Unfortunately, their results can be polluted by identical alignments (called redundancies) coming from the same protein or DNA sequences present in different entries of the database. This makes the efficient use of the listed alignments difficult. Pretreatment of databases has been proposed to suppress strictly identical entries. However, there still remain many identical alignments since redundancies may occur locally for entries corresponding to various fragments of the same sequence or for entries corre...
Motivation: Comparison of nucleic acid and protein sequences is a fundamental tool of modern bioinfo...
The fact that biological sequences can be represented as strings belonging to a finite alphabet (A, ...
The increasing number of biological databases today requires that users are able to search more effi...
International audienceWith genome sequencing projects producing huge amounts of sequence data, datab...
A key concept in comparing sequence collections is the issue of redundancy. The production of sequen...
DNA sequence similarity search is an important task in computational biology applications. Similarit...
Non-redundant protein datasets are of utmost importance in bioinformatics. Constructing such dataset...
Sequence similarity in biological databases is used to characterize a newly discovered protein and c...
The ever increasing number of sequences in protein databases usually turns out large numbers of homo...
Cataloged from PDF version of article.Sequence similarity tools, such as BLAST, seek sequences most ...
Sequence similarity tools, such as BLAST, seek sequences most similar to a query from a database of ...
MOTIVATION: Sequence alignment methods that compare two sequences (pairwise methods) are important t...
Efficient and accurate search in biological sequence databases remains a matter of priority due to t...
Sequence similarity tools, such as BLAST, seek sequences most similar to a query from a database of ...
Duplicate sequence records - that is, records having similar or identical sequences - are a challeng...
Motivation: Comparison of nucleic acid and protein sequences is a fundamental tool of modern bioinfo...
The fact that biological sequences can be represented as strings belonging to a finite alphabet (A, ...
The increasing number of biological databases today requires that users are able to search more effi...
International audienceWith genome sequencing projects producing huge amounts of sequence data, datab...
A key concept in comparing sequence collections is the issue of redundancy. The production of sequen...
DNA sequence similarity search is an important task in computational biology applications. Similarit...
Non-redundant protein datasets are of utmost importance in bioinformatics. Constructing such dataset...
Sequence similarity in biological databases is used to characterize a newly discovered protein and c...
The ever increasing number of sequences in protein databases usually turns out large numbers of homo...
Cataloged from PDF version of article.Sequence similarity tools, such as BLAST, seek sequences most ...
Sequence similarity tools, such as BLAST, seek sequences most similar to a query from a database of ...
MOTIVATION: Sequence alignment methods that compare two sequences (pairwise methods) are important t...
Efficient and accurate search in biological sequence databases remains a matter of priority due to t...
Sequence similarity tools, such as BLAST, seek sequences most similar to a query from a database of ...
Duplicate sequence records - that is, records having similar or identical sequences - are a challeng...
Motivation: Comparison of nucleic acid and protein sequences is a fundamental tool of modern bioinfo...
The fact that biological sequences can be represented as strings belonging to a finite alphabet (A, ...
The increasing number of biological databases today requires that users are able to search more effi...