There is a trend towards using accelerators to increase performance and energy efficiency of general-purpose processors. Adoption of accelerators, however, depends on the availability of tools to facilitate programming these devices. In this paper, we present. techniques for automatically partitioning programs for execution on accelerators. We call the off-loaded code regions sub-algorithms, which are parts of the program that are loosely connected to the remainder of the program. We present three heuristics for automatically identifying sub-algorithms based oil control flow and data flow properties. Analysis of SPECint and MiBench benchmarks shows that, on average 12 sub-algorithms are identified (up to 54), covering the full execution tim...
Re configurable FPGA/CPU systems are widely described in literature as a viable processing solution ...
The demand for high performance has driven acyclic computation accelerators into extensive use in mo...
This work studies programmability enhancing abstractions in the context of accelerators and heteroge...
There is a trend towards using accelerators to increase performance and energy efficiency of general...
In the heterogeneous computing execution model, one or more general-purpose processors are accelerat...
The StreamIt programming model has been proposed to exploit parallelism in streaming applications ...
Abstract — Modern processors have a small on-chip local memory for instructions. Usually it is in th...
This paper presents a new partitioning method for software oriented hardware /software codesign. It ...
The efficient solution of a large problem on a small systolic array requires good partitioning techn...
grantor: University of TorontoHigh performance can be obtained on field-programmable custo...
The integration of microprocessors and field-programmable gate array (FPGA) fabric on a single chip ...
Stream based languages are a popular approach to expressing parallelism in modern applications. The ...
On current multiprocessor architectures one must carefully distribute data in memory in order to ach...
Hardware accelerators are an energy efficient alternative to general purpose processors for specific...
Abstract. SIMD hardware accelerators offer an alternative to manycores when energy consumption and p...
Re configurable FPGA/CPU systems are widely described in literature as a viable processing solution ...
The demand for high performance has driven acyclic computation accelerators into extensive use in mo...
This work studies programmability enhancing abstractions in the context of accelerators and heteroge...
There is a trend towards using accelerators to increase performance and energy efficiency of general...
In the heterogeneous computing execution model, one or more general-purpose processors are accelerat...
The StreamIt programming model has been proposed to exploit parallelism in streaming applications ...
Abstract — Modern processors have a small on-chip local memory for instructions. Usually it is in th...
This paper presents a new partitioning method for software oriented hardware /software codesign. It ...
The efficient solution of a large problem on a small systolic array requires good partitioning techn...
grantor: University of TorontoHigh performance can be obtained on field-programmable custo...
The integration of microprocessors and field-programmable gate array (FPGA) fabric on a single chip ...
Stream based languages are a popular approach to expressing parallelism in modern applications. The ...
On current multiprocessor architectures one must carefully distribute data in memory in order to ach...
Hardware accelerators are an energy efficient alternative to general purpose processors for specific...
Abstract. SIMD hardware accelerators offer an alternative to manycores when energy consumption and p...
Re configurable FPGA/CPU systems are widely described in literature as a viable processing solution ...
The demand for high performance has driven acyclic computation accelerators into extensive use in mo...
This work studies programmability enhancing abstractions in the context of accelerators and heteroge...