Recently, we have benchmarked and tuned the MILC code on a number of architectures including Intel Itanium and Pentium IV (PIV), dual-CPU Athlon, and the latest Compaq Alpha nodes. Results will be presented for many of these, and we shall discuss some simple code changes that can result in a very dramatic speedup of the KS conjugate gradient on processors with more advanced memory systems such as PIV, IBM SP and Alpha. 1
Reconfigurable computing can significantly improve the performance and energy efficiency of many app...
SIGLEAvailable from British Library Document Supply Centre- DSC:0678.231F(AD-A--193940)(microfiche) ...
Average computation time per particle and integration step for the benchmark system of Sec. Performa...
Recently, we have benchmarked and tuned the MILC code on a number of architectures including Intel I...
With recent developments in parallel supercomputing architecture, many core, multi-core, and GPU pro...
We review our work done to optimize the staggered conjugate gradient (CG) algorithm in the MILC code...
ISC High Performance 2016 International Workshops: Application Performance on Intel Xeon Phi – Being...
The MIMD Lattice Computation (MILC) code (version 7.4.0) is a set of codes developed by the MIMD Lat...
The High-Performance Conjugate Gradient (HPCG) benchmark complements the LINPACK benchmark in the pe...
The landscape of HPC architectures has undergone signi cant change in the last few years. Notably, t...
In this session we show, in two case studies, how the roofline feature of Intel Advisor has been uti...
The performance of conjugate gradient (CG) algorithms for the solution of the system of linear equat...
Fast Illinois solver code (FISC) requirements about memory and central processing unit (CPU) time ar...
The slope is an important distance parameter for a convolutional code. It can be used to obtain a lo...
FPGA devices used in the HPC context promise an increased energy efficiency, enhancing the computing...
Reconfigurable computing can significantly improve the performance and energy efficiency of many app...
SIGLEAvailable from British Library Document Supply Centre- DSC:0678.231F(AD-A--193940)(microfiche) ...
Average computation time per particle and integration step for the benchmark system of Sec. Performa...
Recently, we have benchmarked and tuned the MILC code on a number of architectures including Intel I...
With recent developments in parallel supercomputing architecture, many core, multi-core, and GPU pro...
We review our work done to optimize the staggered conjugate gradient (CG) algorithm in the MILC code...
ISC High Performance 2016 International Workshops: Application Performance on Intel Xeon Phi – Being...
The MIMD Lattice Computation (MILC) code (version 7.4.0) is a set of codes developed by the MIMD Lat...
The High-Performance Conjugate Gradient (HPCG) benchmark complements the LINPACK benchmark in the pe...
The landscape of HPC architectures has undergone signi cant change in the last few years. Notably, t...
In this session we show, in two case studies, how the roofline feature of Intel Advisor has been uti...
The performance of conjugate gradient (CG) algorithms for the solution of the system of linear equat...
Fast Illinois solver code (FISC) requirements about memory and central processing unit (CPU) time ar...
The slope is an important distance parameter for a convolutional code. It can be used to obtain a lo...
FPGA devices used in the HPC context promise an increased energy efficiency, enhancing the computing...
Reconfigurable computing can significantly improve the performance and energy efficiency of many app...
SIGLEAvailable from British Library Document Supply Centre- DSC:0678.231F(AD-A--193940)(microfiche) ...
Average computation time per particle and integration step for the benchmark system of Sec. Performa...