Loop Transformations for NUMA Machines

Wei Li
Keshav Pingali

Open link

Publication date

January 1993

DOI

10.1145/156668.156674

ISSN

0362-1340

Abstract

this paper, we describe a framework for loop transformations and code generation for NUMA (non-uniform memory access) machines. Most scalable parallel machines can be classified as NUMA machines because a processor can access data in its local memory ten to a thousand times faster than it can access non-local data. In addition, when a processor must make a number of accesses to data residing at a remote processor, it is usually more efficient to use block transfers of data rather than to use many small messages. Furthermore, each processor usually has a data cache. A system for programming these machines must tackle the following challenges: (1) expose and exploit parallelism in programs, (2) manage data to avoid making non-local accesses, ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Loop Transformations for NUMA Machines

Abstract

Extracted data

Loop Transformations for NUMA Machines

Abstract

Extracted data

Related items

Related items