Adaptations of the Aho-Corasick (AC) algorithm on high performance graphics processors (also called GPUs) have garnered increasing attention in recent years. However, no results have been reported regarding their implementations on mobile GPUs. In this paper, we show that implement-ing a state-of-the-art Aho-Corasick parallel algorithm on a mobile GPU delivers significant speedups. We study a few implementation optimizations some of which may seem counter-intuitive to standard optimizations for high-end GPUs. More importantly, we focus on measuring the energy con-sumed by different components of the OpenCL application rather than reporting the aggregate. We show that there are considerable energy savings compared to the CPU im-plementation ...
It is commonplace for graphics processing units or GPUs today to render extremely complex 3D scenes ...
Pattern discovery is one of the fundamental tasks in bioinformatics and pattern recognition is a pow...
In this work, we examine the performance and energy efficiency when using Python for developing HPC ...
[[abstract]]Graphics processing units (GPUs) have attracted a lot of attention due to their cost-eff...
Abstract—Consumers of personal devices such as desktops, tablets, or smart phones run applications b...
Despite the fact that GPU was originally intended to be as a co-processor specializing in graphics r...
Abstract—We develop GPU adaptations of the Aho-Corasick string matching algorithm for the two cases ...
AbstractPattern matching algorithm is the basis of information biology as well as information retrie...
Modern mobile devices are often required to process multiple computationally intensive applications ...
Abstract—In this paper, we parallelize and optimize the popular feature detection algorithms, i.e. S...
The basic and standard multiple patterns string matching algorithm is Aho-Corasick invented by Alfre...
Energy optimization is an increasingly important aspect of today's high-performance computing applic...
Heterogeneous Multi-Processor Systems-on-Chips (MPSoCs) containing CPU and GPU cores are typically r...
Abstract—We develop GPU adaptations of the Aho-Corasick and multipattern Boyer-Moore string matching...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
It is commonplace for graphics processing units or GPUs today to render extremely complex 3D scenes ...
Pattern discovery is one of the fundamental tasks in bioinformatics and pattern recognition is a pow...
In this work, we examine the performance and energy efficiency when using Python for developing HPC ...
[[abstract]]Graphics processing units (GPUs) have attracted a lot of attention due to their cost-eff...
Abstract—Consumers of personal devices such as desktops, tablets, or smart phones run applications b...
Despite the fact that GPU was originally intended to be as a co-processor specializing in graphics r...
Abstract—We develop GPU adaptations of the Aho-Corasick string matching algorithm for the two cases ...
AbstractPattern matching algorithm is the basis of information biology as well as information retrie...
Modern mobile devices are often required to process multiple computationally intensive applications ...
Abstract—In this paper, we parallelize and optimize the popular feature detection algorithms, i.e. S...
The basic and standard multiple patterns string matching algorithm is Aho-Corasick invented by Alfre...
Energy optimization is an increasingly important aspect of today's high-performance computing applic...
Heterogeneous Multi-Processor Systems-on-Chips (MPSoCs) containing CPU and GPU cores are typically r...
Abstract—We develop GPU adaptations of the Aho-Corasick and multipattern Boyer-Moore string matching...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
It is commonplace for graphics processing units or GPUs today to render extremely complex 3D scenes ...
Pattern discovery is one of the fundamental tasks in bioinformatics and pattern recognition is a pow...
In this work, we examine the performance and energy efficiency when using Python for developing HPC ...