In modern multiprocessor systems, processors can be stalled by inter-task communication when reading from a remote buffer. This paper presents a solution for the inter-task communication, that has a minimal impact on the performance of the system, hides the inter-task communication latency without requiring additional hardware. The solution applies to jobs, represented as task graphs, where the tasks are nested loop programs. Buffers are allocated in scratch-pad memories of the consuming tasks to provide low latency read access. For the nested loop programs, minimal buffer sizes can be determined to cover all possible communication patterns. The added computational complexity is low, as the solution adds only a few operations to the nested ...
Communication and synchronization stand as the dual bottlenecks in the performance of parallel syste...
Consider a scheduling problem of parallel computations in multiprocessor systems. Let a parallel pro...
[[abstract]]Several loop-buffering techniques were proposed for reducing power consumption of embedd...
Abstract — In modern multiprocessor systems, proces-sors can be stalled by inter-task communication ...
Multimedia applications, executed by embedded multiprocessor systems, can in some cases be represent...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
Nested loops are normally the most time intensive tasks in computer algorithms. These loops often in...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
Multimedia applications process streams of values and can often be represented as task graphs. For p...
Although, computer system architecture and the throughput enhances continuously, the need for high c...
Application specific MPSoCs are often used to implement high-performance data-intensive applications...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1988.Most future supercomputers wi...
Consider a scheduling problem of parallel computations in multiprocessor systems. Let a parallel pro...
Interprocessor communication has been one of the major overheads in parallel processing. Although sy...
Communication and synchronization stand as the dual bottlenecks in the performance of parallel syste...
Consider a scheduling problem of parallel computations in multiprocessor systems. Let a parallel pro...
[[abstract]]Several loop-buffering techniques were proposed for reducing power consumption of embedd...
Abstract — In modern multiprocessor systems, proces-sors can be stalled by inter-task communication ...
Multimedia applications, executed by embedded multiprocessor systems, can in some cases be represent...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
Nested loops are normally the most time intensive tasks in computer algorithms. These loops often in...
Link to published version: http://ieeexplore.ieee.org/iel2/390/6075/00236705.pdf?tp=&arnumber=236705...
Multimedia applications process streams of values and can often be represented as task graphs. For p...
Although, computer system architecture and the throughput enhances continuously, the need for high c...
Application specific MPSoCs are often used to implement high-performance data-intensive applications...
Communication overhead in multiprocessor systems, as exemplified by cache coherency traffic and glob...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1988.Most future supercomputers wi...
Consider a scheduling problem of parallel computations in multiprocessor systems. Let a parallel pro...
Interprocessor communication has been one of the major overheads in parallel processing. Although sy...
Communication and synchronization stand as the dual bottlenecks in the performance of parallel syste...
Consider a scheduling problem of parallel computations in multiprocessor systems. Let a parallel pro...
[[abstract]]Several loop-buffering techniques were proposed for reducing power consumption of embedd...