In this paper we present MP-Queue, a flexible and efficient queue-based communication library for MPSoCs. Our library is suitable for a wide range of hardware platforms and its configuration space is explored across a wide number of dimensions. We introduce an upper-bound evaluation metric to compare the efficiency of the library against an ideal point-to-point data transfer. We can thus quantitatively assess the overhead introduced by the synchronization protocol and by shared bus contention. We discuss source-level optimizations introduced in the library that enable aggressive compiler optimizations, without compromising code portability. A significant speedup is achieved w.r.t a non-optimized library (15% for small-size messages), while ...
This paper presents a portable optimization for MPI communications, called PRAcTICaL-MPI (Portable A...
Efficient programming of multimedia streaming applications for Consumer Electronics (CE) devices is ...
[[abstract]]Dual-core processors (and, to an extent, multicore processors) have been adopted in rece...
In this paper we present MP-Queue, a flexible and efficient queue-based communication library for MP...
Kelly W, Flasskamp M, Sievers G, et al. A Communication Model and Partitioning Algorithm for Streami...
This thesis presents the design and implementation of a Chip-Multiprocessor (CMP) targeted at stream...
Streaming applications are an important class of applications in emerging embedded systems such as s...
Energy efficient embedded computing enables new application scenarios in mobile devices like softwar...
International audienceStreaming applications, such as packet switching or video and multimedia proce...
Server performance is a crucial factor for many distributed, clientserver based multimedia applicati...
Among the various paradigms of parallelization, pipeline parallelism has the advantage of maintainin...
Embedded streaming applications specified using parallel Models of Computation (MoC) often contain a...
We present MPWide, a light weight communication library which allows efficient message passing over ...
In this paper, we jointly optimize computation and communication task scheduling for streaming appli...
This thesis addresses the problem of designing performance and energy efficient embedded streaming s...
This paper presents a portable optimization for MPI communications, called PRAcTICaL-MPI (Portable A...
Efficient programming of multimedia streaming applications for Consumer Electronics (CE) devices is ...
[[abstract]]Dual-core processors (and, to an extent, multicore processors) have been adopted in rece...
In this paper we present MP-Queue, a flexible and efficient queue-based communication library for MP...
Kelly W, Flasskamp M, Sievers G, et al. A Communication Model and Partitioning Algorithm for Streami...
This thesis presents the design and implementation of a Chip-Multiprocessor (CMP) targeted at stream...
Streaming applications are an important class of applications in emerging embedded systems such as s...
Energy efficient embedded computing enables new application scenarios in mobile devices like softwar...
International audienceStreaming applications, such as packet switching or video and multimedia proce...
Server performance is a crucial factor for many distributed, clientserver based multimedia applicati...
Among the various paradigms of parallelization, pipeline parallelism has the advantage of maintainin...
Embedded streaming applications specified using parallel Models of Computation (MoC) often contain a...
We present MPWide, a light weight communication library which allows efficient message passing over ...
In this paper, we jointly optimize computation and communication task scheduling for streaming appli...
This thesis addresses the problem of designing performance and energy efficient embedded streaming s...
This paper presents a portable optimization for MPI communications, called PRAcTICaL-MPI (Portable A...
Efficient programming of multimedia streaming applications for Consumer Electronics (CE) devices is ...
[[abstract]]Dual-core processors (and, to an extent, multicore processors) have been adopted in rece...