Excessive energy consumption has become one of the major challenges in high performance computing. Reducing the energy consumption of frequently used high performance computing applications not only saves the energy cost but also reduces the greenhouse gas emissions. This paper focuses on developing energy efficient algorithms and software for the widely used matrix-matrix multiplication, so that it is able to consume less energy in a DVFS-enabled cluster with little sacrifice in performance. The state-of-the-art practical parallel matrix matrix multiplication algorithm in ScaLAPACK partitions matrices into small blocks and distributes matrices using a two dimensional block cyclic distribution approach. Experimental results demonstrate that...
Matrix multiplication is one of the important operations in scientific and engineering application. ...
In this paper, we present a new load balancing technique, called panel scattering, which is generall...
Boosting performance and energy efficiency of scientific applications running on high performance co...
Excessive energy consumption has become one of the major challenges in high performance computing. R...
AbstractThe demands of improving energy efficiency for high performance scientific applications aris...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
This paper describes a novel parallel algorithm that implements a dense matrix multiplication operat...
Matrix-matrix multiplication is one of the core computations in many algorithms from scientific comp...
The multiplication of a vector by a matrix is the kernel operation in many algorithms used in scient...
Matrix multiplication is one of the important operations in scientific and engineering application. ...
Abstract—Energy efficiency has emerged as one of the key performance metrics in computing. In this w...
International audienceGPU matrix chain multiplication serves as a basis for a wide range of scientif...
The demands of improving energy efficiency for high performance scientific applications arise crucia...
In this thesis, the performance and energy efficiency of four different implementations of matrix mu...
Matrix multiplication is one of the important operations in scientific and engineering application. ...
In this paper, we present a new load balancing technique, called panel scattering, which is generall...
Boosting performance and energy efficiency of scientific applications running on high performance co...
Excessive energy consumption has become one of the major challenges in high performance computing. R...
AbstractThe demands of improving energy efficiency for high performance scientific applications aris...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
This paper describes a novel parallel algorithm that implements a dense matrix multiplication operat...
Matrix-matrix multiplication is one of the core computations in many algorithms from scientific comp...
The multiplication of a vector by a matrix is the kernel operation in many algorithms used in scient...
Matrix multiplication is one of the important operations in scientific and engineering application. ...
Abstract—Energy efficiency has emerged as one of the key performance metrics in computing. In this w...
International audienceGPU matrix chain multiplication serves as a basis for a wide range of scientif...
The demands of improving energy efficiency for high performance scientific applications arise crucia...
In this thesis, the performance and energy efficiency of four different implementations of matrix mu...
Matrix multiplication is one of the important operations in scientific and engineering application. ...
In this paper, we present a new load balancing technique, called panel scattering, which is generall...
Boosting performance and energy efficiency of scientific applications running on high performance co...