Abstract — In modern multiprocessor systems, proces-sors can be stalled by inter-task communication when read-ing from a remote buffer. This paper presents a solution for the inter-task communication, that has a minimal im-pact on the performance of the system, hides the inter-task communication latency without requiring additional hard-ware. The solution applies to jobs, represented as task graphs, where the tasks are nested loop programs. Buffers are allocated in scratch-pad memories of the consuming tasks to provide low latency read access. For the nested loop programs, minimal buffer sizes can be determined to cover all possible communication patterns. The added computational complexity is low, as the solution adds only a few operations...
Communication and synchronization stand as the dual bottlenecks in the performance of parallel syste...
Shared-memory multiprocessors are becoming increasingly popular as a high-performance, easy to progr...
Interprocessor communication has been one of the major overheads in parallel processing. Although sy...
In modern multiprocessor systems, processors can be stalled by inter-task communication when reading...
Multimedia applications, executed by embedded multiprocessor systems, can in some cases be represent...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
Nested loops are normally the most time intensive tasks in computer algorithms. These loops often in...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
Multimedia applications process streams of values and can often be represented as task graphs. For p...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1988.Most future supercomputers wi...
Although, computer system architecture and the throughput enhances continuously, the need for high c...
Application specific MPSoCs are often used to implement high-performance data-intensive applications...
We present buffered coscheduling, a new methodology to multitask parallel jobs in a message-passing ...
Consider a scheduling problem of parallel computations in multiprocessor systems. Let a parallel pro...
Communication and synchronization stand as the dual bottlenecks in the performance of parallel syste...
Shared-memory multiprocessors are becoming increasingly popular as a high-performance, easy to progr...
Interprocessor communication has been one of the major overheads in parallel processing. Although sy...
In modern multiprocessor systems, processors can be stalled by inter-task communication when reading...
Multimedia applications, executed by embedded multiprocessor systems, can in some cases be represent...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
Nested loops are normally the most time intensive tasks in computer algorithms. These loops often in...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
Multimedia applications process streams of values and can often be represented as task graphs. For p...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1988.Most future supercomputers wi...
Although, computer system architecture and the throughput enhances continuously, the need for high c...
Application specific MPSoCs are often used to implement high-performance data-intensive applications...
We present buffered coscheduling, a new methodology to multitask parallel jobs in a message-passing ...
Consider a scheduling problem of parallel computations in multiprocessor systems. Let a parallel pro...
Communication and synchronization stand as the dual bottlenecks in the performance of parallel syste...
Shared-memory multiprocessors are becoming increasingly popular as a high-performance, easy to progr...
Interprocessor communication has been one of the major overheads in parallel processing. Although sy...