As the discipline of scientific computing grows, so too does the "skills gap" between the increasingly complex scientific applications and the efficient algorithms required. Increasing demand for computational power on the march towards exascale requires innovative approaches. Closing the skills gap avoids the many pitfalls that lead to poor utilisation of resources and wasted investment. This thesis tackles two challenges: asynchronous algorithms for parallel computing and fault tolerance. First I present a novel asynchronous task invocation methodology for Discontinuous Galerkin codes called enclave tasking. The approach modifies the parallel ordering of tasks that allows for efficient scaling on dynamic meshes up to 756 cores. It ensures...
Supercomputers continue to expand both in size and complexity as we reach the beginning of the exasc...
Depuis le milieu des années 1990, les bibliothèques de transmission de messages sont les technologie...
International audienceThis paper describes an environment dedicated to the building of efficient sci...
Many natural and engineering systems are governed by nonlinear partial differential equations (PDEs)...
Asynchronous task-based programming models are gaining popularity to address the programmability and...
Massively parallel supercomputers are susceptible to variable performance due to factors such as di...
Modern computers can no longer rely on increasing CPU speed to improve their performance as further ...
It has become common knowledge that parallel programming is needed for scientific applications, part...
Balancing the workload of sophisticated simulations is inherently difficult, since we have to balanc...
Governments, universities, and companies expend vast resources building the top supercomputers. The...
Advances in machine learning and streaming systems provide a backbone to transform vast arrays of ra...
In the realm of distributed computing, collective operations involve coordinated communication and s...
Asynchronous methods minimize idle times by removing synchronization barriers, and therefore allow t...
Software is becoming increasingly concurrent: parallelization, decentralization, and reactivity nece...
In an era where we can not afford to checkpoint frequently, replication is a generic way forward to ...
Supercomputers continue to expand both in size and complexity as we reach the beginning of the exasc...
Depuis le milieu des années 1990, les bibliothèques de transmission de messages sont les technologie...
International audienceThis paper describes an environment dedicated to the building of efficient sci...
Many natural and engineering systems are governed by nonlinear partial differential equations (PDEs)...
Asynchronous task-based programming models are gaining popularity to address the programmability and...
Massively parallel supercomputers are susceptible to variable performance due to factors such as di...
Modern computers can no longer rely on increasing CPU speed to improve their performance as further ...
It has become common knowledge that parallel programming is needed for scientific applications, part...
Balancing the workload of sophisticated simulations is inherently difficult, since we have to balanc...
Governments, universities, and companies expend vast resources building the top supercomputers. The...
Advances in machine learning and streaming systems provide a backbone to transform vast arrays of ra...
In the realm of distributed computing, collective operations involve coordinated communication and s...
Asynchronous methods minimize idle times by removing synchronization barriers, and therefore allow t...
Software is becoming increasingly concurrent: parallelization, decentralization, and reactivity nece...
In an era where we can not afford to checkpoint frequently, replication is a generic way forward to ...
Supercomputers continue to expand both in size and complexity as we reach the beginning of the exasc...
Depuis le milieu des années 1990, les bibliothèques de transmission de messages sont les technologie...
International audienceThis paper describes an environment dedicated to the building of efficient sci...