The construction of neighbour lists based on the linked cell method is investigated in the context of particle simulation methods within the OpenMP shared memory programming model. Various implementations are studied which avoid memory collisions and race conditions. Performance and optimisation considerations are made along with run time behaviour and memory requirements. Performance models are proposed, which reproduce the measured runtime behaviour and which provide insight into the performance dependence on specific system parameters. Benchmarks are performed for different implementations on a number of multi-core architectures and thread numbers up to 240 are considered on the Xeon Phi architecture in the SMT mode, so that performance ...
Circuit-level simulation is a computationally intensive problem that has proven to be particularly d...
The Configuration Interaction (CI) method has been widely used to solve the non-relativistic many-bo...
MCBEND is a general purpose radiation transport Monte Carlo code from AMEC Foster Wheelers’s ANSWERS...
This paper presents a set of proposals for the OpenMP shared--memory programming model oriented tow...
This paper presents a set of proposals for the OpenMP shared-- memory programming model oriented to...
Today's supercomputers often consists of clusters of SMP nodes. Both OpenMP and MPI are programming ...
The thesis offers a comparison of OpenMP and Intel Threading Building blocks. The two are threading ...
In this paper, we present the first system that implements OpenMP on a network of shared-memory mult...
In this paper we will make an experimental description of the parallel programming using OpenMP. Usi...
Loop-based parallelism is a common in scientific codes. OpenMP proposes such work-sharing construct ...
The efficient iteration of neighbouring particles is a performance critical aspect of any high perfo...
The mixing of shared memory and message passing programming models within a single application has o...
Abstract—Comparison between OpenMP for thread programming model and MPI for message passing programm...
This paper discusses the implementation of particle based numerical methods on multi-core machines. ...
In this paper we discuss the use of nested parallelism. Our claim is that if the problem naturally p...
Circuit-level simulation is a computationally intensive problem that has proven to be particularly d...
The Configuration Interaction (CI) method has been widely used to solve the non-relativistic many-bo...
MCBEND is a general purpose radiation transport Monte Carlo code from AMEC Foster Wheelers’s ANSWERS...
This paper presents a set of proposals for the OpenMP shared--memory programming model oriented tow...
This paper presents a set of proposals for the OpenMP shared-- memory programming model oriented to...
Today's supercomputers often consists of clusters of SMP nodes. Both OpenMP and MPI are programming ...
The thesis offers a comparison of OpenMP and Intel Threading Building blocks. The two are threading ...
In this paper, we present the first system that implements OpenMP on a network of shared-memory mult...
In this paper we will make an experimental description of the parallel programming using OpenMP. Usi...
Loop-based parallelism is a common in scientific codes. OpenMP proposes such work-sharing construct ...
The efficient iteration of neighbouring particles is a performance critical aspect of any high perfo...
The mixing of shared memory and message passing programming models within a single application has o...
Abstract—Comparison between OpenMP for thread programming model and MPI for message passing programm...
This paper discusses the implementation of particle based numerical methods on multi-core machines. ...
In this paper we discuss the use of nested parallelism. Our claim is that if the problem naturally p...
Circuit-level simulation is a computationally intensive problem that has proven to be particularly d...
The Configuration Interaction (CI) method has been widely used to solve the non-relativistic many-bo...
MCBEND is a general purpose radiation transport Monte Carlo code from AMEC Foster Wheelers’s ANSWERS...