In this thesis, the performance and energy efficiency of four different implementations of matrix multiplication, written in OmpSs and OpenCL, is tested and evaluated. The benchmarking is done using an Intel Ivy Bridge Core i7 3770K. The results are evaluated and discussed with regards to different optimization configurations, like vectorization and multi-threading. Energy measurements are taken using PAPI, which in turn uses the Running Average Power Limit interface in the Intel processor to take energy readings. Performance is presented using MFLOPS, while energy efficiency is compared using MFLOPS/W, watts used, and the energy delay product and energy delay squared. The OpenCL versions are compared with and without vectorization. One of ...
In this paper, we present OMPSs, a programming model based on OpenMP and StarSs, that can also incor...
Shared memory multicore processor technology is pervasive in mainstream computing. This new architec...
This paper studies the performance and energy consumption of several multi-core, multi-CPUs and many...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
The paper deals with the energy consumption evaluation of selected Sparse and Dense BLAS Level 1, 2 ...
Proceedings of: Third International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2016...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...
Today’s computer systems develop towards less energy consumption while keeping high performance. The...
International audienceIn this paper, we analyse performance and energy consumption of five OpenMP ru...
The main goal of this research is to use OpenMP, Posix Threads and Microsoft Parallel Patterns libra...
This paper examines how to write code to gain high performance on modern computers as well as the im...
The proposed research goal is to introduce a new architecture for systems to increase performance an...
Excessive energy consumption has become one of the major challenges in high performance computing. R...
Abstract. Energy efficiency and power consumption have become an imperative requirement in Computer ...
In this paper, we present OMPSs, a programming model based on OpenMP and StarSs, that can also incor...
Shared memory multicore processor technology is pervasive in mainstream computing. This new architec...
This paper studies the performance and energy consumption of several multi-core, multi-CPUs and many...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
The paper deals with the energy consumption evaluation of selected Sparse and Dense BLAS Level 1, 2 ...
Proceedings of: Third International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2016...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...
Today’s computer systems develop towards less energy consumption while keeping high performance. The...
International audienceIn this paper, we analyse performance and energy consumption of five OpenMP ru...
The main goal of this research is to use OpenMP, Posix Threads and Microsoft Parallel Patterns libra...
This paper examines how to write code to gain high performance on modern computers as well as the im...
The proposed research goal is to introduce a new architecture for systems to increase performance an...
Excessive energy consumption has become one of the major challenges in high performance computing. R...
Abstract. Energy efficiency and power consumption have become an imperative requirement in Computer ...
In this paper, we present OMPSs, a programming model based on OpenMP and StarSs, that can also incor...
Shared memory multicore processor technology is pervasive in mainstream computing. This new architec...
This paper studies the performance and energy consumption of several multi-core, multi-CPUs and many...