This paper investigates the development of a molecular dynamics code that is highly portable between architectures. Using OpenCL, we develop an implementation of Sandia’s miniMD benchmark that achieves good levels of performance across a wide range of hardware: CPUs, discrete GPUs and integrated GPUs. We demonstrate that the performance bottlenecks of miniMD’s short-range force calculation kernel are the same across these architectures, and detail a number of platform- agnostic optimisations that improve its performance by at least 2x on all hardware considered. Our complete code is shown to be 1.7x faster than the original miniMD, and at most 2x slower than implementations individually hand-tuned for a specific architecture
The acceleration of molecular dynamics (MD) simulations using high-performance reconfigurable comput...
Molecular dynamics (MD) is a powerful computer simulation technique providing atomistic resolution a...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level be...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
Classical molecular dynamics simulation for atomistic systems is implemented in OpenCL and benchmark...
Molecular Dynamics (MD) is a widely used tool for simulations of particle systems with pair-wise int...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
The gap between a supercomputer's theoretical maximum (\peak") oatingpoint performance and that a...
AbstractThe architecture of high performance computing systems is becoming more and more heterogeneo...
Abstract The architecture of high performance computing systems is becoming more and more heteroge...
Thesis (Ph.D.)--Boston University PLEASE NOTE: Boston University Libraries did not receive an Aut...
Significantly increasing intra-node parallelism is widely recognised as being a key prerequisite for...
Copyright: © 2015 Materials Research SocietyThis article discusses novel algorithms for molecular-dy...
2012-04-27The shift to many-core architecture design paradigm in computer market has provided unprec...
The acceleration of molecular dynamics (MD) simulations using high-performance reconfigurable comput...
Molecular dynamics (MD) is a powerful computer simulation technique providing atomistic resolution a...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level be...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
Classical molecular dynamics simulation for atomistic systems is implemented in OpenCL and benchmark...
Molecular Dynamics (MD) is a widely used tool for simulations of particle systems with pair-wise int...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
The gap between a supercomputer's theoretical maximum (\peak") oatingpoint performance and that a...
AbstractThe architecture of high performance computing systems is becoming more and more heterogeneo...
Abstract The architecture of high performance computing systems is becoming more and more heteroge...
Thesis (Ph.D.)--Boston University PLEASE NOTE: Boston University Libraries did not receive an Aut...
Significantly increasing intra-node parallelism is widely recognised as being a key prerequisite for...
Copyright: © 2015 Materials Research SocietyThis article discusses novel algorithms for molecular-dy...
2012-04-27The shift to many-core architecture design paradigm in computer market has provided unprec...
The acceleration of molecular dynamics (MD) simulations using high-performance reconfigurable comput...
Molecular dynamics (MD) is a powerful computer simulation technique providing atomistic resolution a...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...