Processor parallelism considerations and memory latency reduction in shared memory multiprocessors

Lilja, David John

Abstract

A wide variety of computer architectures have been proposed to exploit parallelism at different granularities. These architectures have significant differences in instruction scheduling constraints, memory latencies, and synchronization overhead, making it difficult to determine which architecture can achieve the best performance on a given program. Trace-driven simulations and analytic models are used to compare the instruction-level parallelism of a superscalar processor and a pipelined processor with the loop-level parallelism of a shared memory multiprocessor. It is shown that the maximum speedup for a loop with a cyclic dependence graph is limited by its critical dependence ratio, independent of the number of iterations in the loop. Th...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Processor parallelism considerations and memory latency reduction in shared memory multiprocessors

Abstract

Extracted data

Processor parallelism considerations and memory latency reduction in shared memory multiprocessors

Abstract

Extracted data

Related items

Related items