International audiencePosition Weight Matrices (PWMs) are broadly used in computational biology. The basic problems, Scan and MultipleScan, aim to find all the occurrences of a given PWM or a set of PWMs in long sequences. Some other PWM tasks share a common NP-hard subproblem, ScoreDistribution. The existing algorithms rely on the enumeration on a large set of scores or words, and they are mostly not suitable for parallelization. We propose a new algorithm, BucketScoreDistribution, that is both very efficient and suitable for parallelization. We bound the error induced by this algorithm. We realized a GPU prototype for Scan, MultipleScan and BucketScoreDistribution with the CUDA libraries, and report for the different problems speedups lar...
Graphics processor a b s t r a c t Finding regions of similarity between two very long data streams ...
The idea of using a graphics processing unit (GPU) for more than simply graphic output purposes has ...
Background: The huge quantity of data produced in Biomedical research needs sophisticated algorithmi...
International audiencePosition Weight Matrices (PWMs) are broadly used in computational biology. The...
Background The identification of all matches of a large set of position weight matrices (PWMs) in lo...
International audienceText matching with errors is a regular task in computational biology. We prese...
Detecting similarities between (RNA, DNA, and protein) sequences is an important part of bioinformat...
One of the most ambitious trends in current biomedical research is the large-scale genomic sequencin...
Massively parallel DNA sequencing technologies have revolutionized genomics and molecular biology by...
The exponential growth of databases that contains biological information (such as protein and DNA da...
Background: Position Weight Matrices (PWMs) are probabilistic representations of signals in sequence...
The increasing amount of sequences stored in genomic databases has become unfeasible to the sequenti...
Abstract—This paper describes a multi-threaded parallel design and implementation of the Smith-Water...
Background: Tandem mass spectrometry-based database searching is currently the main method for prote...
Markov clustering is becoming a key algorithm within bioinformatics for determining clusters in netw...
Graphics processor a b s t r a c t Finding regions of similarity between two very long data streams ...
The idea of using a graphics processing unit (GPU) for more than simply graphic output purposes has ...
Background: The huge quantity of data produced in Biomedical research needs sophisticated algorithmi...
International audiencePosition Weight Matrices (PWMs) are broadly used in computational biology. The...
Background The identification of all matches of a large set of position weight matrices (PWMs) in lo...
International audienceText matching with errors is a regular task in computational biology. We prese...
Detecting similarities between (RNA, DNA, and protein) sequences is an important part of bioinformat...
One of the most ambitious trends in current biomedical research is the large-scale genomic sequencin...
Massively parallel DNA sequencing technologies have revolutionized genomics and molecular biology by...
The exponential growth of databases that contains biological information (such as protein and DNA da...
Background: Position Weight Matrices (PWMs) are probabilistic representations of signals in sequence...
The increasing amount of sequences stored in genomic databases has become unfeasible to the sequenti...
Abstract—This paper describes a multi-threaded parallel design and implementation of the Smith-Water...
Background: Tandem mass spectrometry-based database searching is currently the main method for prote...
Markov clustering is becoming a key algorithm within bioinformatics for determining clusters in netw...
Graphics processor a b s t r a c t Finding regions of similarity between two very long data streams ...
The idea of using a graphics processing unit (GPU) for more than simply graphic output purposes has ...
Background: The huge quantity of data produced in Biomedical research needs sophisticated algorithmi...