We apply the concept of subset seeds to similarity search in protein sequences. The main question studied is the design of efficient seed alphabets to construct seeds with optimal sensitivity/selectivity trade-offs. We propose several different design methods and use them to construct several alphabets. We then perform a comparative analysis of seeds built over those alphabets and compare them with the standard Blastp seeding method, as well as with the family of vector seeds. While the formalism of subset seeds is less expressive (but less costly to implement) than the cumulative principle used in Blastp and vector seeds, our seeds show a similar or even better performance than Blastp on Bernoulli models of proteins compatible with the com...
Abstract—We present a framework for improving local protein alignment algorithms. Specifically, we d...
Abstract—Sequence similarity search is a common and re-peated task in molecular biology. The rapid g...
Homology search finds similar segments between two biological sequences, such as DNA or protein sequ...
We apply the concept of subset seeds to similarity search in protein sequences. The main question st...
We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The ...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
Abstract—We present a framework for improving local protein alignment algorithms. Specifically, we d...
Abstract—Sequence similarity search is a common and re-peated task in molecular biology. The rapid g...
Homology search finds similar segments between two biological sequences, such as DNA or protein sequ...
We apply the concept of subset seeds to similarity search in protein sequences. The main question st...
We apply the concept of subset seeds proposed in [1] to similarity search in protein sequences. The ...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceWe apply the concept of subset seeds proposed in [1] to similarity search in p...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
International audienceThe seeding technique became central in the theory of sequence alignment and t...
Abstract—We present a framework for improving local protein alignment algorithms. Specifically, we d...
Abstract—Sequence similarity search is a common and re-peated task in molecular biology. The rapid g...
Homology search finds similar segments between two biological sequences, such as DNA or protein sequ...