Abstract. An emerging class of architectures are accelerator-based het-erogeneous multiprocessors with software-managed memory hierarchies like Cell/BE. A major difficulty in programming such kind of machines are the explicit data transfers between the different memories raising new programming challenges. In this paper we discuss a programming approach which supports application programmers in writing efficient code for non-cache-based architectures. A crucial role plays the interplay between the three parties programmer, parallelization framework, and native compiler. Based on our experiences with past programming ap-proaches, we propose language extensions to orchestrate parallel execu-tion of threads and to control data transfers. Exper...
The Cell BE processor provides both scalable computation power and flexibility, and it is already be...
Recent efforts in microprocessor development tend to the coexistence of several Central Pro-cessing ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
We address programming of accelerator-based heterogeneous multiprocessors in the context of computat...
This paper describes the implementation of a runtime library for asynchronous communication in the C...
We present MPOpt-Cell, an architecture-aware framework for high-productivity development and efficie...
Abstract. This paper describes the implementation of a runtime library for asynchronous communicatio...
The Cell BE processor provides both scalable computation power and flexibility, and it is already be...
While accelerators have become more prevalent in recent years, they are still considered hard to pro...
Single-core architectures have hit the end of the road and industry and academia are currently explo...
Before it can achieve wide acceptance,parallel computation must be made significantly easier to prog...
Ease of programming is one of the main impediments for the broad acceptance of multi-core systems wi...
Parallel architectures are the way of the future, but are notoriously difficult to program. In addit...
This paper describes the approaches taken to extend an array programming language compiler using a ...
The heterogeneous design of multi-core processors, such as the Cell processor, introduced new challe...
The Cell BE processor provides both scalable computation power and flexibility, and it is already be...
Recent efforts in microprocessor development tend to the coexistence of several Central Pro-cessing ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...
We address programming of accelerator-based heterogeneous multiprocessors in the context of computat...
This paper describes the implementation of a runtime library for asynchronous communication in the C...
We present MPOpt-Cell, an architecture-aware framework for high-productivity development and efficie...
Abstract. This paper describes the implementation of a runtime library for asynchronous communicatio...
The Cell BE processor provides both scalable computation power and flexibility, and it is already be...
While accelerators have become more prevalent in recent years, they are still considered hard to pro...
Single-core architectures have hit the end of the road and industry and academia are currently explo...
Before it can achieve wide acceptance,parallel computation must be made significantly easier to prog...
Ease of programming is one of the main impediments for the broad acceptance of multi-core systems wi...
Parallel architectures are the way of the future, but are notoriously difficult to program. In addit...
This paper describes the approaches taken to extend an array programming language compiler using a ...
The heterogeneous design of multi-core processors, such as the Cell processor, introduced new challe...
The Cell BE processor provides both scalable computation power and flexibility, and it is already be...
Recent efforts in microprocessor development tend to the coexistence of several Central Pro-cessing ...
As the demand increases for high performance and power efficiency in modern computer runtime systems...