A Cache-centric Execution Model and Runtime for Deep Parallel Multicore Topologies

Pericas, Miquel

Open link

Publication date

January 2016

DOI

10.1145/2967938.2974052

Publisher

Association for Computing Machinery (ACM)

Citation count (estimate)

Abstract

Computational task DAGs are executed on parallel computers by a task scheduling algorithm. Intelligent scheduling is critical for achieving high parallelism, low overheads and reduced communication. A key technique for load balancing task DAGs is work stealing (WS), which Blumofe et al. popularized for fork-join computations [2]. In scenarios of high parallel slackness, WS\u27s distributed nature allows it to scale to a large number of cores with low overhead [4]. However, the space of a WS computation grows proportionally to the number of cores. Targeting a lower bound, Blelloch et al. proposed the parallel-depth-first (PDF) scheduler [1]. PDF schedules tasks by following the depth-first (serial) order of computation and has space requirem...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A Cache-centric Execution Model and Runtime for Deep Parallel Multicore Topologies

Abstract

Extracted data

A Cache-centric Execution Model and Runtime for Deep Parallel Multicore Topologies

Abstract

Extracted data

Related items

Related items