Abstract—We present an inter-architectural comparison of single- and double-precision direct n-body implementations on modern multicore platforms, including those based on the Intel Nehalem and AMD Barcelona systems, the Sony-Toshiba-IBM PowerXCell/8i processor, and NVIDIA Tesla C870 and C1060 GPU systems. We compare our implementations across platforms on a variety of proxy measures, including performance, coding complexity, and energy efficiency. I
AbstractWe have implemented a fast collisionless N-body code which runs on GPU, the peak performance...
Abstract. Recently, hybrid architectures using accelerators like GP-GPUs or the Cell processor have ...
In this work, we evaluate performance of a real-world image processing application that uses a cross...
Abstract—N-body simulations are computation-intensive ap-plications that calculate the motion of a l...
With the emergence of general-purpose computation on graphics processing units, high-level approache...
This paper studies the performance and energy consumption of several multi-core, multi-CPUs and many...
Direct-summation N-body algorithms compute the gravitational interaction between stars in an exact w...
Direct-summation N-body algorithms compute the gravitational interaction between stars in an exact w...
Algorithms designed to efficiently solve this classical problem of physics fit very well on GPU hard...
The main performance bottleneck of gravitational N-body codes is the force calculation between two p...
We present the results of gravitational direct N-body simulations using the graphics processing unit...
The development of graphic processing units have during the last decade improved significantly in pe...
The development of graphic processing units have during the last decade improved significantly in pe...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
AbstractWe have implemented a fast collisionless N-body code which runs on GPU, the peak performance...
Abstract. Recently, hybrid architectures using accelerators like GP-GPUs or the Cell processor have ...
In this work, we evaluate performance of a real-world image processing application that uses a cross...
Abstract—N-body simulations are computation-intensive ap-plications that calculate the motion of a l...
With the emergence of general-purpose computation on graphics processing units, high-level approache...
This paper studies the performance and energy consumption of several multi-core, multi-CPUs and many...
Direct-summation N-body algorithms compute the gravitational interaction between stars in an exact w...
Direct-summation N-body algorithms compute the gravitational interaction between stars in an exact w...
Algorithms designed to efficiently solve this classical problem of physics fit very well on GPU hard...
The main performance bottleneck of gravitational N-body codes is the force calculation between two p...
We present the results of gravitational direct N-body simulations using the graphics processing unit...
The development of graphic processing units have during the last decade improved significantly in pe...
The development of graphic processing units have during the last decade improved significantly in pe...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
AbstractWe have implemented a fast collisionless N-body code which runs on GPU, the peak performance...
Abstract. Recently, hybrid architectures using accelerators like GP-GPUs or the Cell processor have ...
In this work, we evaluate performance of a real-world image processing application that uses a cross...