The life-cycle of a partial differential equation (PDE) solver is often characterized by three development phases: the development of a stable numerical discretization; development of a correct (verified) implementation; and the optimization of the implementation for different computer architectures. Often it is only after significant time and effort has been invested that the performance bottlenecks of a PDE solver are fully understood, and the precise details varies between different computer architectures. One way to mitigate this issue is to establish a reliable performance model that allows a numerical analyst to make reliable predictions of how well a numerical method would perform on a given computer architecture, before embarking up...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
International audienceThe roofline model is a popular approach to ``bounds and bottleneck''performan...
This paper will show a comparison between the Kepler, Maxwell and Pascal GPU architectures using CUD...
The continued development of improved algorithms and architecture for numerical simulations is at th...
Future architectures designed to deliver exascale performance motivate the need for novel algorithmi...
AbstractFuture architectures designed to deliver exascale performance motivate the need for novel al...
Processor technology is dramatically advancing and promises enormous improvements in processing data...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
This paper is concerned with modeling the perfor-mance of high-order finite-difference schemes for h...
Distributed systems such as networks of workstations are becoming an increasingly viable alternative...
The development of simulation tools is becoming an important area in industry, recently fostered b...
This thesis documents the analysis and optimization of a high-order finite difference computational ...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
International audiencePower consumption and energy efficiency are becoming critical aspects in the d...
This project serves to apply numerical methods to obtain solutions for the wave equation. Paralleliz...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
International audienceThe roofline model is a popular approach to ``bounds and bottleneck''performan...
This paper will show a comparison between the Kepler, Maxwell and Pascal GPU architectures using CUD...
The continued development of improved algorithms and architecture for numerical simulations is at th...
Future architectures designed to deliver exascale performance motivate the need for novel algorithmi...
AbstractFuture architectures designed to deliver exascale performance motivate the need for novel al...
Processor technology is dramatically advancing and promises enormous improvements in processing data...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
This paper is concerned with modeling the perfor-mance of high-order finite-difference schemes for h...
Distributed systems such as networks of workstations are becoming an increasingly viable alternative...
The development of simulation tools is becoming an important area in industry, recently fostered b...
This thesis documents the analysis and optimization of a high-order finite difference computational ...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
International audiencePower consumption and energy efficiency are becoming critical aspects in the d...
This project serves to apply numerical methods to obtain solutions for the wave equation. Paralleliz...
This paper discusses the implementation of one-factor and three-factor PDE models on GPUs. Both expl...
International audienceThe roofline model is a popular approach to ``bounds and bottleneck''performan...
This paper will show a comparison between the Kepler, Maxwell and Pascal GPU architectures using CUD...