Abstract—N-body simulations are computation-intensive ap-plications that calculate the motion of a large number of bodies under pair-wise forces. Although different versions of n-body codes have been widely used in many scientific fields, the perfor-mance and energy efficiency of various n-body codes have not been comprehensively studied, especially when they are running on newly released multi-core CPUs and GPUs (e.g., Tesla K20). In this paper, we evaluate the performance and energy efficiency of five parallel n-body implementations on two different multi-core CPU systems and on two different types of GPUs. Our ex-perimental results show that up to 71 % of the energy can be saved by using all cores of a Xeon E5620 CPU instead of only one....
Abstract—We present an inter-architectural comparison of single- and double-precision direct n-body ...
N-body problems, such as simulating the motion of stars in a galaxy and evaluating the spatial stati...
Using two full applications with different characteristics, this thesis explores the performance and...
Algorithms designed to efficiently solve this classical problem of physics fit very well on GPU hard...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
With the emergence of general-purpose computation on graphics processing units, high-level approache...
N-Body simulation simulates the evolution of a system that is composed of N particles, where each el...
N-Body simulation simulates the evolution of a system that is composed of N particles, where each el...
N-Body simulation simulates the evolution of a system that is composed of N particles, where each el...
N-Body simulation simulates the evolution of a system that is composed of N particles, where each el...
Hybrid computational architectures based on the joint power of Central Processing Units (CPUs) and G...
Increasing heterogeneity among HPC platforms requires applications to be frequently ported and tuned...
N-Body simulation algorithms are amongst the most commonly used within the field of scientific compu...
We compare the performance of two very different parallel gravitational N-body codes for astrophysic...
Abstract—We present an inter-architectural comparison of single- and double-precision direct n-body ...
N-body problems, such as simulating the motion of stars in a galaxy and evaluating the spatial stati...
Using two full applications with different characteristics, this thesis explores the performance and...
Algorithms designed to efficiently solve this classical problem of physics fit very well on GPU hard...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
With the emergence of general-purpose computation on graphics processing units, high-level approache...
N-Body simulation simulates the evolution of a system that is composed of N particles, where each el...
N-Body simulation simulates the evolution of a system that is composed of N particles, where each el...
N-Body simulation simulates the evolution of a system that is composed of N particles, where each el...
N-Body simulation simulates the evolution of a system that is composed of N particles, where each el...
Hybrid computational architectures based on the joint power of Central Processing Units (CPUs) and G...
Increasing heterogeneity among HPC platforms requires applications to be frequently ported and tuned...
N-Body simulation algorithms are amongst the most commonly used within the field of scientific compu...
We compare the performance of two very different parallel gravitational N-body codes for astrophysic...
Abstract—We present an inter-architectural comparison of single- and double-precision direct n-body ...
N-body problems, such as simulating the motion of stars in a galaxy and evaluating the spatial stati...
Using two full applications with different characteristics, this thesis explores the performance and...