A high performance Matrix-Matrix Multiplication Methodology for CPU and GPU architectures

Kelefouras, Vasilios
Kritikakou, Angeliki
Mporas, Iosif
Kolonias, Vasilios

Open link

Publication date

January 2016

DOI

10.1007/s11227-015-1613-7

Publisher

Springer Science and Business Media LLC

Abstract

International audienceCurrent compilers cannot generate code that can compete with hand-tuned code in efficiency, even for a simple kernel like matrix–matrix multiplication (MMM). A key step in program optimization is the estimation of optimal values for parameters such as tile sizes and number of levels of tiling. The scheduling parameter values selection is a very difficult and time-consuming task, since parameter values depend on each other; this is why they are found by using searching methods and empirical techniques. To overcome this problem, the scheduling sub-problems must be optimized together, as one problem and not separately. In this paper, an MMM methodology is presented where the optimum scheduling parameters are found by decr...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A high performance Matrix-Matrix Multiplication Methodology for CPU and GPU architectures

Abstract

Extracted data

A high performance Matrix-Matrix Multiplication Methodology for CPU and GPU architectures

Abstract

Extracted data

Related items

Related items