StVEC: A Vector Instruction Extension for High Performance Stencil Computation

Naser Sedaghati
Renji Thomas
Louis-noël Pouchet
Radu Teodorescu

Publication date

October 2015

Abstract

Abstract—Stencil computations comprise the compute-intensive core of many scientific applications. The data access pattern of stencil computations often requires several adjacent data elements of arrays to be accessed in innermost parallel loops. Although such loops are vectorized by current compilers like GCC and ICC that target short-vector SIMD instruction sets, a number of redundant loads or additional intra-register data shuffle operations are required, reducing the achievable performance. Thus, even when all arrays are cache resident, the peak performance achieved with stencil computations is considerably lower than machine peak. In this paper, we present a hardware-based solution for this problem. We propose an extension to the stand...

Extracted data

We use cookies to provide a better user experience.

Data Protection

StVEC: A Vector Instruction Extension for High Performance Stencil Computation

Abstract

Extracted data

StVEC: A Vector Instruction Extension for High Performance Stencil Computation

Abstract

Extracted data

Related items

Related items