International audienceThe efficiency of a pleasingly parallel application is studied for several computing platforms. A real world problem, i.e., Monte-Carlo numerical simulations of stratospheric balloon envelope drift descent is considered. We detail the optimization of the SIMD parallel codes on the K40 and K80 GPUs as well as on the Intel Xeon Phi. We emphasize on loop and task parallelism, multi-threading and vectorization, respectively. The experiments show that GPU and MIC permit one to decrease computing time by non negligeable factors, as compared to a parallel code implemented on a two sockets CPU (E5-2680-v2) which finally allows us to use these devices in operational condition
To take advantage of the computational power of GPUs, algorithms that work well on CPUs must be modi...
This paper describes a parallel implementation of the direct simulation Monte Carlo method. Runtime...
Abstract. We argue that Monte Carlo algorithms are ideally suited to parallel computing, and that “p...
International audienceThe efficiency of a pleasingly parallel application is studied for several com...
International audienceA performance evaluation of parallel Monte-Carlo simulations on GPU and MIC is...
We present a case-study on the utility of graphics cards to perform massively parallel sim ulation w...
We present a case-study on the utility of graphics cards to perform massively parallel simulation of...
We present a case study on the utility of graphics cards to perform massively parallel simulation of...
This work presents a study of the applicability of a massively parallel computing paradigm to Monte ...
The Monte Carlo neutron transport method can be naturally parallelized by multi-core architectures d...
The efficient parallel implementation of a 3-D Monte Carlo device simulator is described. The parall...
This paper describes a parallel implementation of the direct simulation Monte Carlo (DSMC) method. R...
In this contribution we describe an efficient GPU implementation of the Monte-Carlo simulation of th...
The continual development of sophisticated transport simulation algorithms allows increasingly accur...
AbstractThe computational performance of a smoothed particle hydrodynamics (SPH) simulation is inves...
To take advantage of the computational power of GPUs, algorithms that work well on CPUs must be modi...
This paper describes a parallel implementation of the direct simulation Monte Carlo method. Runtime...
Abstract. We argue that Monte Carlo algorithms are ideally suited to parallel computing, and that “p...
International audienceThe efficiency of a pleasingly parallel application is studied for several com...
International audienceA performance evaluation of parallel Monte-Carlo simulations on GPU and MIC is...
We present a case-study on the utility of graphics cards to perform massively parallel sim ulation w...
We present a case-study on the utility of graphics cards to perform massively parallel simulation of...
We present a case study on the utility of graphics cards to perform massively parallel simulation of...
This work presents a study of the applicability of a massively parallel computing paradigm to Monte ...
The Monte Carlo neutron transport method can be naturally parallelized by multi-core architectures d...
The efficient parallel implementation of a 3-D Monte Carlo device simulator is described. The parall...
This paper describes a parallel implementation of the direct simulation Monte Carlo (DSMC) method. R...
In this contribution we describe an efficient GPU implementation of the Monte-Carlo simulation of th...
The continual development of sophisticated transport simulation algorithms allows increasingly accur...
AbstractThe computational performance of a smoothed particle hydrodynamics (SPH) simulation is inves...
To take advantage of the computational power of GPUs, algorithms that work well on CPUs must be modi...
This paper describes a parallel implementation of the direct simulation Monte Carlo method. Runtime...
Abstract. We argue that Monte Carlo algorithms are ideally suited to parallel computing, and that “p...