We address the problem of estimating the sensitivity of seed-based similarity search algorithms. In contrast to approaches based on Markov models [18, 6, 3, 4, 10], we study the estimation based on homogeneous alignments. We describe an algorithm for counting and random generation of those alignments and an algorithm for exact computation of the sensitivity for a broad class of seed strategies. We provide experimental results demonstrating a bias introduced by ignoring the homogeneousness condition
The challenge of similarity search in massive DNA sequence databases has inspired major changes in B...
We apply the concept of subset seeds to similarity search in protein sequences. The main question st...
We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The ...
We address the problem of estimating the sensitivity of seed-based similarity search algorithms. In ...
We address the problem of measuring the sensitivity of seed-based similarity search algorithms. In c...
We propose a general approach to compute the seed sensitivity, that can be applied to different def...
We propose a general approach to compute the seed sensitivity, that can be applied to different defi...
We propose a general approach to compute the seed sensitivity, that can be applied to different defi...
International audienceWe propose a general approach to compute the seed sensitivity, that can be app...
Spaced seeds are a fundamental tool for similarity search in biosequences. The best sensitivity/sele...
Motivation: Standard search techniques for DNA repeats start by identifying small matching words, or...
Large-scale comparison of genomic DNA is of fundamental importance in annotating functional elements...
AbstractGenomics studies routinely depend on similarity searches based on the strategy of finding sh...
AbstractLarge-scale comparison of genomic DNA is of fundamental importance in annotating functional ...
AbstractThe novel introduction of spaced seed idea in the filtration stage of sequence comparison by...
The challenge of similarity search in massive DNA sequence databases has inspired major changes in B...
We apply the concept of subset seeds to similarity search in protein sequences. The main question st...
We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The ...
We address the problem of estimating the sensitivity of seed-based similarity search algorithms. In ...
We address the problem of measuring the sensitivity of seed-based similarity search algorithms. In c...
We propose a general approach to compute the seed sensitivity, that can be applied to different def...
We propose a general approach to compute the seed sensitivity, that can be applied to different defi...
We propose a general approach to compute the seed sensitivity, that can be applied to different defi...
International audienceWe propose a general approach to compute the seed sensitivity, that can be app...
Spaced seeds are a fundamental tool for similarity search in biosequences. The best sensitivity/sele...
Motivation: Standard search techniques for DNA repeats start by identifying small matching words, or...
Large-scale comparison of genomic DNA is of fundamental importance in annotating functional elements...
AbstractGenomics studies routinely depend on similarity searches based on the strategy of finding sh...
AbstractLarge-scale comparison of genomic DNA is of fundamental importance in annotating functional ...
AbstractThe novel introduction of spaced seed idea in the filtration stage of sequence comparison by...
The challenge of similarity search in massive DNA sequence databases has inspired major changes in B...
We apply the concept of subset seeds to similarity search in protein sequences. The main question st...
We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The ...