The Texas Instruments C66x Digital Signal Processor (DSP) is an embedded processor technology that is targeted at real time signal processing. It is also developed with a high potential to become the new generation of coprocessor technology for high performance embedded computing. Of particular interest is its performance for stencil computations, such as those found in signal processing and computer vision tasks. A stencil is a loop in which the output value is updated at each position of an array by taking a weighted function of its neighbors. Efficiently mapping stencil-based kernels to the C66x device presents two challenges. The first one is how to efficiently optimize loops in order to facilitate the usage of Single Instruction Multip...
We are witnessing a fundamental paradigm shift in computer design. Memory has been and is becoming m...
Although stencil auto-tuning has shown tremendous potential in effectively utilizing architectural r...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
The Texas Instruments C66x Digital Signal Processor (DSP) is an embedded processor technology that i...
Embedded system-on-chip processors such as the Texas Instruments C66 DSP and the IBM Cell provide th...
We are witnessing a fundamental paradigm shift in computer design. Memory has been and is becoming m...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
Stencil computations are an integral component of applications in a number of scientific computing d...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
Stencil computations are a key class of applications, widely used in the scientific computing commun...
A large number of algorithms for multidimensional signals processing and scientific computation come...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
A large number of algorithms for multidimensional signals processing and scientific computation come...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
We are witnessing a fundamental paradigm shift in computer design. Memory has been and is becoming m...
Although stencil auto-tuning has shown tremendous potential in effectively utilizing architectural r...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
The Texas Instruments C66x Digital Signal Processor (DSP) is an embedded processor technology that i...
Embedded system-on-chip processors such as the Texas Instruments C66 DSP and the IBM Cell provide th...
We are witnessing a fundamental paradigm shift in computer design. Memory has been and is becoming m...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
Stencil computations are an integral component of applications in a number of scientific computing d...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
Stencil computations are a key class of applications, widely used in the scientific computing commun...
A large number of algorithms for multidimensional signals processing and scientific computation come...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
A large number of algorithms for multidimensional signals processing and scientific computation come...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
We are witnessing a fundamental paradigm shift in computer design. Memory has been and is becoming m...
Although stencil auto-tuning has shown tremendous potential in effectively utilizing architectural r...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...