Executing array based applications on a chip multiprocessor requires effective loop parallelization techniques. One of the critical issues that need to be tackled by an optimizing compiler in this context is loop scheduling, which distributes the iterations of a loop to be executed in parallel across the available processors. Most of the existing work in this area targets cache based execution platforms. In comparison, this paper proposes the first dynamic loop scheduler, to our knowledge, that targets scratch-pad memory (SPM) based chip multiprocessors, and presents an experimental evaluation of it. The main idea behind our approach is to identify the set of loop iterations that access the SPM and those that do not. This information is exp...
Nowadays, many embedded processors include in their architecture on-chip static memories, so called ...
Funder: FP7 People: Marie‐Curie Actions; Id: http://dx.doi.org/10.13039/100011264; Grant(s): 327744S...
Loop scheduling has significant differences in multithreaded from other parallel processors. The sha...
In a parallel system with multiple CPUs, one of the key prob-lems is to assign loop iterations to pr...
. Dynamic loop scheduling algorithms can suffer from overheads due to synchronisation, loss of local...
Embedded systems have three common principles: real-time performance, low power consumption, and low...
Abstract—Dynamic scheduling algorithms have been success-fully used for parallel computations of nes...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
<p>An increasing number of processor architectures support scratch-pad memory - software manag...
Exploiting parallelism in loops in programs is an important factor in realizing the potential perfor...
In this paper we explore the idea of customizing and reusing loop schedules to improve the scalabili...
In this research we propose a highly predictable, low overhead and yet dynamic, memory allocation st...
In this research we propose a highly predictable, low overhead and yet dynamic, memory allocation s...
The presence of multiple active threads on the same processor can mask latency by rapid context swit...
Previous algorithms for parallelizing loops on MIMD machines have been based on assigning one or mor...
Nowadays, many embedded processors include in their architecture on-chip static memories, so called ...
Funder: FP7 People: Marie‐Curie Actions; Id: http://dx.doi.org/10.13039/100011264; Grant(s): 327744S...
Loop scheduling has significant differences in multithreaded from other parallel processors. The sha...
In a parallel system with multiple CPUs, one of the key prob-lems is to assign loop iterations to pr...
. Dynamic loop scheduling algorithms can suffer from overheads due to synchronisation, loss of local...
Embedded systems have three common principles: real-time performance, low power consumption, and low...
Abstract—Dynamic scheduling algorithms have been success-fully used for parallel computations of nes...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
<p>An increasing number of processor architectures support scratch-pad memory - software manag...
Exploiting parallelism in loops in programs is an important factor in realizing the potential perfor...
In this paper we explore the idea of customizing and reusing loop schedules to improve the scalabili...
In this research we propose a highly predictable, low overhead and yet dynamic, memory allocation st...
In this research we propose a highly predictable, low overhead and yet dynamic, memory allocation s...
The presence of multiple active threads on the same processor can mask latency by rapid context swit...
Previous algorithms for parallelizing loops on MIMD machines have been based on assigning one or mor...
Nowadays, many embedded processors include in their architecture on-chip static memories, so called ...
Funder: FP7 People: Marie‐Curie Actions; Id: http://dx.doi.org/10.13039/100011264; Grant(s): 327744S...
Loop scheduling has significant differences in multithreaded from other parallel processors. The sha...