This white-paper reports on our efforts to enable an SPH-based Fortran code on the Intel Xeon Phi. As a result of the work described here , the two most computationally intensive subroutines (rates and shepard_beta) of the UCD-SPH code were refactored and parallelised with OpenMP for the first time, enabling the code to be executed on multi-core and many-core shared memory systems. This parallelisation achieved speedups of up to 4.3x for the rates subroutine and 6.0x for the shepard_beta subroutine resulting in overall speedups of up to 4.2x on a 2 processor Sandy Bridge Xeon E5 machine. The code was subsequently enabled and refactored to execute in different modes on the Intel Xeon Phi co-processor achieving speedups of up to 2.8x for the ...
The Intel R Xeon PhiTM is the first processor based on Intel’s MIC (Many Integrated Cores) architect...
After at least a decade of parallel tool development, parallelization of scientific applications rem...
With the increasing size and complexity of data produced by large scale numerical simulations, it is...
This white-paper reports on our efforts to enable an SPH-based Fortran code on the Intel Xeon Phi. A...
11 pages, 10 figures, 9 references. Other author's papers can be downloaded at http://www.denys-duty...
This white paper reports on ours e orts to optimize a 2D/3D astrophysical (magento-)hydrodynamics Fo...
CTH is a family of codes developed at Sandia National Laboratories for use in modeling complex multi...
The whitepaper reports our investigation into the porting, optimization and subsequent performance o...
The goal of this lab exercise is to develop a parallel compute-intensive application to be run on an...
This paper contains two parts revolving around Monte Carlo transport simulation on Intel Many Integr...
A serial source code for simulating a supersonic ejector flow is accelerated using parallelization b...
Numerical simulations of fluids in astrophysics and computational fluid dynamics (CFD) are among the...
<p>Fortran MPI benchmark for MPI sum reductions for poster & extended abstract "Accelerating MPI Red...
AbstractThe computational performance of a smoothed particle hydrodynamics (SPH) simulation is inves...
We have obtained a dedicated computational cluster of eight DEC Alpha systems interconnected by 100 ...
The Intel R Xeon PhiTM is the first processor based on Intel’s MIC (Many Integrated Cores) architect...
After at least a decade of parallel tool development, parallelization of scientific applications rem...
With the increasing size and complexity of data produced by large scale numerical simulations, it is...
This white-paper reports on our efforts to enable an SPH-based Fortran code on the Intel Xeon Phi. A...
11 pages, 10 figures, 9 references. Other author's papers can be downloaded at http://www.denys-duty...
This white paper reports on ours e orts to optimize a 2D/3D astrophysical (magento-)hydrodynamics Fo...
CTH is a family of codes developed at Sandia National Laboratories for use in modeling complex multi...
The whitepaper reports our investigation into the porting, optimization and subsequent performance o...
The goal of this lab exercise is to develop a parallel compute-intensive application to be run on an...
This paper contains two parts revolving around Monte Carlo transport simulation on Intel Many Integr...
A serial source code for simulating a supersonic ejector flow is accelerated using parallelization b...
Numerical simulations of fluids in astrophysics and computational fluid dynamics (CFD) are among the...
<p>Fortran MPI benchmark for MPI sum reductions for poster & extended abstract "Accelerating MPI Red...
AbstractThe computational performance of a smoothed particle hydrodynamics (SPH) simulation is inves...
We have obtained a dedicated computational cluster of eight DEC Alpha systems interconnected by 100 ...
The Intel R Xeon PhiTM is the first processor based on Intel’s MIC (Many Integrated Cores) architect...
After at least a decade of parallel tool development, parallelization of scientific applications rem...
With the increasing size and complexity of data produced by large scale numerical simulations, it is...