International audienceWe present VOBLA, a domain-specific language designed for programming linear algebra libraries. VOBLA is compiled to PENCIL, a domain independent intermediate language designed for efficient mapping to accelerator architectures such as GPGPUs. PENCIL is compiled to efficient, platform-specific OpenCL code using techniques based on the polyhedral model. This approach addresses both the programmer productivity and performance portability concerns associated with accelerator programming.We demonstrate our approach by using VOBLA to implement a BLAS library. We have evaluated the performance of OpenCL code generated using our compilation flow on ARM Mali, AMD Radeon, and AMD Opteron platforms. The generated code is current...
Clifford Algebra (CA) is a powerful mathematical language that allows for a simple and intuitive rep...
Current processor architectures are diverse and heterogeneous. Examples include multicore chips, GPU...
Programming accelerators such as GPUs with low-level APIs and languages such as OpenCL and CUDA is d...
Programming accelerators such as GPUs with low-level APIs and languages such as OpenCL and CUDA is...
© 2015 IEEE.Programming accelerators such as GPUs withlow-level APIs and languages such as OpenCL an...
The trend of using co-processors as accelerators to perform certain tasks is rising in the parallel...
Abstract—GPUs have been successfully used for acceleration of many mathematical functions and librar...
We have invested heavily in hardware development but software tools and methods to use the hardware ...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
This paper introduces TIRAMISU, a polyhedral framework designed to generate high performance code fo...
High performance parallel computing was something exclusive for expensive specialized hardware some ...
Parallel accelerators are playing an increasingly important role in scientific computing. However, i...
massive parallel processor, has been widely used in general purposed computing tasks. Although there...
While many linear algebra libraries have been developed to optimize their performance, no linear alg...
OpenCL has been proposed as a means of accelerating functional computation using FPGA and GPU accele...
Clifford Algebra (CA) is a powerful mathematical language that allows for a simple and intuitive rep...
Current processor architectures are diverse and heterogeneous. Examples include multicore chips, GPU...
Programming accelerators such as GPUs with low-level APIs and languages such as OpenCL and CUDA is d...
Programming accelerators such as GPUs with low-level APIs and languages such as OpenCL and CUDA is...
© 2015 IEEE.Programming accelerators such as GPUs withlow-level APIs and languages such as OpenCL an...
The trend of using co-processors as accelerators to perform certain tasks is rising in the parallel...
Abstract—GPUs have been successfully used for acceleration of many mathematical functions and librar...
We have invested heavily in hardware development but software tools and methods to use the hardware ...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
This paper introduces TIRAMISU, a polyhedral framework designed to generate high performance code fo...
High performance parallel computing was something exclusive for expensive specialized hardware some ...
Parallel accelerators are playing an increasingly important role in scientific computing. However, i...
massive parallel processor, has been widely used in general purposed computing tasks. Although there...
While many linear algebra libraries have been developed to optimize their performance, no linear alg...
OpenCL has been proposed as a means of accelerating functional computation using FPGA and GPU accele...
Clifford Algebra (CA) is a powerful mathematical language that allows for a simple and intuitive rep...
Current processor architectures are diverse and heterogeneous. Examples include multicore chips, GPU...
Programming accelerators such as GPUs with low-level APIs and languages such as OpenCL and CUDA is d...