With the advent of manycore systems, shared memory parallelisation has gained importance in high performance computing. Once a code is decomposed into tasks or parallel regions, it becomes crucial to identify reasonable grain sizes, i.e. minimum problem sizes per task that make the algorithm expose a high concurrency at low overhead. Many papers do not detail what reasonable task sizes are, and consider their findings craftsmanship not worth discussion. We have implemented an autotuning algorithm, a machine learning approach, for a project developing a hyperbolic equation system solver. Autotuning here is important as the grid and task workload are multifaceted and change frequently during runtime. In this paper, we summarise our lessons le...
dissertationSolutions to Partial Di erential Equations (PDEs) are often computed by discretizing the...
Autotuning is an established technique for optimizing the performance of parallel applications. Howe...
As simulation and analytics enter the exascale era, numerical algorithms, particularly implicit solv...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
We study the performance behaviour of a seismic simulation using the ExaHyPE engine with a specific ...
Compiler-based auto-parallelization is a much studied area, yet has still not found wide-spread appl...
Manual tuning of applications for heterogeneous parallel systems is tedious and complex. Optimizati...
Compiler-based auto-parallelization is a much studied area, yet has still not found wide-spread appl...
Computer simulations that solve partial differential equations (PDEs) are common in many fields of s...
In prior-research the authors have demonstrated that, for stencil-based numerical solvers for Partia...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
Parallelisation is becoming more and more important as the single core performance increase is stagn...
Computational Grids lend themselves well to parameter sweep applications,which consist of independen...
AbstractThis paper addresses two key parallelization challenges the unstructured mesh-based ocean mo...
dissertationSolutions to Partial Di erential Equations (PDEs) are often computed by discretizing the...
Autotuning is an established technique for optimizing the performance of parallel applications. Howe...
As simulation and analytics enter the exascale era, numerical algorithms, particularly implicit solv...
This dissertation studies the sources of poor performance in scientific computing codes based on par...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
We study the performance behaviour of a seismic simulation using the ExaHyPE engine with a specific ...
Compiler-based auto-parallelization is a much studied area, yet has still not found wide-spread appl...
Manual tuning of applications for heterogeneous parallel systems is tedious and complex. Optimizati...
Compiler-based auto-parallelization is a much studied area, yet has still not found wide-spread appl...
Computer simulations that solve partial differential equations (PDEs) are common in many fields of s...
In prior-research the authors have demonstrated that, for stencil-based numerical solvers for Partia...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
Parallelisation is becoming more and more important as the single core performance increase is stagn...
Computational Grids lend themselves well to parameter sweep applications,which consist of independen...
AbstractThis paper addresses two key parallelization challenges the unstructured mesh-based ocean mo...
dissertationSolutions to Partial Di erential Equations (PDEs) are often computed by discretizing the...
Autotuning is an established technique for optimizing the performance of parallel applications. Howe...
As simulation and analytics enter the exascale era, numerical algorithms, particularly implicit solv...