Background: One of the most frequent uses of bioinformatics tools concerns functional characterization of a newly produced nucleotide sequence (a query sequence) by applying Blast or FASTA against a set of sequences (the subject sequences). However, in some specific contexts, it is useful to compare the query sequence against a cluster such as a MultiAlignment (MA). We present here the RegExpBlasting (REB) algorithm, which compares an unclassified sequence with a dataset of patterns defined by application of Regular Expression rules to a given-as-input MA datasets. The REB algorithm workflow consists in i. the definition of a dataset of multialignments ii. the association of each MA to a pattern, ...
Abstract. The presence of long gaps dramatically increases the diffi-culty of detecting and characte...
While regexp matching is a powerful mechanism for finding patterns in data streams, regexp engines i...
In the biological sciences, sequence analysis refers to analytical investigations that use nucleic a...
Background: One of the most frequent uses of bioinformatics tools concerns functional characteri...
[[abstract]]RE-MuSiC is a web-based multiple sequence alignment tool that can incorporate biological...
String pattern matching is an extensively studied area of computer science. Over the past few decade...
Abstract Background Biologists are often interested i...
This thesis presents an application of a generalized suffix tree extended by the use of frequency of...
In previous work [10], we considered algorithms related to the statistics of matches with words and...
[[abstract]]Imposing constraints is an effective means to incorporate biological knowledge into alig...
The identification of interesting patterns (or subsequences) in biosequences has an important role i...
International audienceWe define a novel variation on the constrained sequence alignment problem in w...
[[abstract]]Imposing constraints is an effective means to incorporate biological knowledge into alig...
Abstract Background Motif analysis methods have long been central for studying biological function o...
GenericBioMatch is a novel algorithm for exact match in biological sequences. It allows the sequence...
Abstract. The presence of long gaps dramatically increases the diffi-culty of detecting and characte...
While regexp matching is a powerful mechanism for finding patterns in data streams, regexp engines i...
In the biological sciences, sequence analysis refers to analytical investigations that use nucleic a...
Background: One of the most frequent uses of bioinformatics tools concerns functional characteri...
[[abstract]]RE-MuSiC is a web-based multiple sequence alignment tool that can incorporate biological...
String pattern matching is an extensively studied area of computer science. Over the past few decade...
Abstract Background Biologists are often interested i...
This thesis presents an application of a generalized suffix tree extended by the use of frequency of...
In previous work [10], we considered algorithms related to the statistics of matches with words and...
[[abstract]]Imposing constraints is an effective means to incorporate biological knowledge into alig...
The identification of interesting patterns (or subsequences) in biosequences has an important role i...
International audienceWe define a novel variation on the constrained sequence alignment problem in w...
[[abstract]]Imposing constraints is an effective means to incorporate biological knowledge into alig...
Abstract Background Motif analysis methods have long been central for studying biological function o...
GenericBioMatch is a novel algorithm for exact match in biological sequences. It allows the sequence...
Abstract. The presence of long gaps dramatically increases the diffi-culty of detecting and characte...
While regexp matching is a powerful mechanism for finding patterns in data streams, regexp engines i...
In the biological sciences, sequence analysis refers to analytical investigations that use nucleic a...