High performance matrix multiplication on many cores

Nan Yuan
Yongbin Zhou
Guangming Tan
Junchao Zhang
Dongrui Fan

Open link

Publication date

January 2009

DOI

10.1007/978-3-642-03869-3_87

ISSN

0302-9743

Citation count (estimate)

Abstract

Abstract. Moore’s Law suggests that the number of processing cores on a single chip increases exponentially. The future performance in-creases will be mainly extracted from thread-level parallelism exploited by multi/many-core processors (MCP). Therefore, it is necessary to find out how to build the MCP hardware and how to program the paral-lelism on such MCP. In this work, we intend to identity the key archi-tecture mechanisms and software optimizations to guarantee high per-formance for multithreaded programs. To illustrate this, we customize a dense matrix multiplication algorithm on Godson-T MCP as a case study to demonstrate the efficient synergy and interaction between hard-ware and software. Experiments conducted on the cycle-accurat...

Extracted data

We use cookies to provide a better user experience.

Data Protection

High performance matrix multiplication on many cores

Abstract

Extracted data

High performance matrix multiplication on many cores

Abstract

Extracted data

Related items

Related items