The Sparc T3-4 server provides up to 512 concurrent hardware threads, a degree of concurrency that is unprecedented in a single server system. This paper reports on how the automatically parallelising compiler of the data-parallel functional array language SAC copes with up to 512 execution units. We investigate three different numerical kernels that are representative for a wide range of applications: matrix multiplication, convolution and 3-dimensional FFT. We show both the high-level declarative coding style of SAC and the performance achieved on the T3-4 server. Last not least, we draw conclusions for improving our compiler technology in the future
This paper describes a general compiler optimization technique that reduces communication overhead f...
This paper presents an overview of the SUIF compiler, which automatically parallelizes and optimizes...
We investigate several dense bit-wise implementations of Boolean arrays in the context of the functi...
The Sparc T3-4 server provides up to 512 concurrent hardware threads, a degree of concurrency that i...
Malleable applications may run with varying numbers of threads, and thus on varying numbers of cores...
SAC (Single Assignment C) is a purely functional, data-parallel array programming language that pred...
Original article is available at: http://www.worldscinet.com Copyright World Scientific Publishing C...
We present the ins and outs of the purely functional, data parallel programming language SaC (Single...
Xeon Phi is the common brand name of Intel's Many Integrated Core (MIC) architecture. The first comm...
We present preliminary results from compiling the high-level, functional and data-parallel programmi...
Data-parallel programming facilitates elegant specification of concurrency. However, the composabili...
SAC (Single Assignment C) is a purely functional (data-parallel) array programming language [6, 2]. ...
Abstract. Data-parallel programming facilitates elegant specification of concurrency. However, the c...
Large scale parallel simulations are fundamental tools for engineers and scientists. Consequently, i...
We present the concept of an adaptive compiler optimisation framework for the functional array progr...
This paper describes a general compiler optimization technique that reduces communication overhead f...
This paper presents an overview of the SUIF compiler, which automatically parallelizes and optimizes...
We investigate several dense bit-wise implementations of Boolean arrays in the context of the functi...
The Sparc T3-4 server provides up to 512 concurrent hardware threads, a degree of concurrency that i...
Malleable applications may run with varying numbers of threads, and thus on varying numbers of cores...
SAC (Single Assignment C) is a purely functional, data-parallel array programming language that pred...
Original article is available at: http://www.worldscinet.com Copyright World Scientific Publishing C...
We present the ins and outs of the purely functional, data parallel programming language SaC (Single...
Xeon Phi is the common brand name of Intel's Many Integrated Core (MIC) architecture. The first comm...
We present preliminary results from compiling the high-level, functional and data-parallel programmi...
Data-parallel programming facilitates elegant specification of concurrency. However, the composabili...
SAC (Single Assignment C) is a purely functional (data-parallel) array programming language [6, 2]. ...
Abstract. Data-parallel programming facilitates elegant specification of concurrency. However, the c...
Large scale parallel simulations are fundamental tools for engineers and scientists. Consequently, i...
We present the concept of an adaptive compiler optimisation framework for the functional array progr...
This paper describes a general compiler optimization technique that reduces communication overhead f...
This paper presents an overview of the SUIF compiler, which automatically parallelizes and optimizes...
We investigate several dense bit-wise implementations of Boolean arrays in the context of the functi...