Computing the actions of Wilson-Dirac operators consumes most of the CPU time for the grand challenge problem of simulating Lattice Quantum Chromodynamics (Lattice QCD). This routine exhibits many challenges to implementation on most computational environments because of the multiple pattern of accessing the same data that make it difficult to align the data efficiently at compile time. Additionally, the low computation to memory access ratio makes this computation both memory bandwidth and memory latency bounded. In this work, we present an implementation of this routine on Cell Broadband Engine. We propose runtime data fusion, an approach aiming at aligning data at runtime, for data that cannot be aligned optimally at compile time, to imp...
AbstractWe report our implementation experience of a lattice gauge theory code on the Cell Broadband...
Numerical simulations of theories describing the interaction of elementary particles are a key appr...
In this paper we describe a single-node, double precision Field Programmable Gate Array (FPGA) imple...
Computing the actions of Wilson-Dirac operators consumes most of the CPU time for the grand challeng...
Lattice Quantum Chromodynamic (QCD) models subatomic interactions based on a four-dimensional discre...
International audienceQuantum chromodynamics (QCD) is the theory of subnuclear physics, aiming at mo...
Quantum Chromodynamics (QCD) is the theory of strong nuclear force, responsible of the interactions ...
We revisit the Wilson-Dirac operator, also refered as Dslash, on multicore vector machines. The Wils...
We evaluate IBM’s Enhanced Cell Broadband Engine (BE) as a possible building block of a new generati...
International audienceWe revisit the Wilson-Dirac operator, also referred as Dslash, on NUMA manycor...
The Cell Broadband Engine architecture is a revolutionary processor architecture well suited for man...
Application-driven computers for Lattice Gauge Theory simulations have often been based on system-on...
AbstractWe report our implementation experience of a lattice gauge theory code on the Cell Broadband...
Numerical simulations of theories describing the interaction of elementary particles are a key appr...
In this paper we describe a single-node, double precision Field Programmable Gate Array (FPGA) imple...
Computing the actions of Wilson-Dirac operators consumes most of the CPU time for the grand challeng...
Lattice Quantum Chromodynamic (QCD) models subatomic interactions based on a four-dimensional discre...
International audienceQuantum chromodynamics (QCD) is the theory of subnuclear physics, aiming at mo...
Quantum Chromodynamics (QCD) is the theory of strong nuclear force, responsible of the interactions ...
We revisit the Wilson-Dirac operator, also refered as Dslash, on multicore vector machines. The Wils...
We evaluate IBM’s Enhanced Cell Broadband Engine (BE) as a possible building block of a new generati...
International audienceWe revisit the Wilson-Dirac operator, also referred as Dslash, on NUMA manycor...
The Cell Broadband Engine architecture is a revolutionary processor architecture well suited for man...
Application-driven computers for Lattice Gauge Theory simulations have often been based on system-on...
AbstractWe report our implementation experience of a lattice gauge theory code on the Cell Broadband...
Numerical simulations of theories describing the interaction of elementary particles are a key appr...
In this paper we describe a single-node, double precision Field Programmable Gate Array (FPGA) imple...