This thesis introduces two tools for efficiently access data of a wavefront algorithm in an out-of-core execution context. These algorithms are easily to parallelize by using macro-pipelining techniques which allow to overlap computations and communications. The first tool is build on the observation that input/output operations in such a situation have disastrous performance: data is scattered on disk and getting them in and out of memory is long and expensive. The new data layout on disk that is proposed resolves these issues by accessing data only in a contiguous way. If the first tool describes how to access data, the second one is a synchronization model that describes when to access them. Indeed, concurrent and parallel execution of w...
GDR-GPLWith the slowdown of Moore's law and the end of the frequency race, the performance comes fro...
International audienceWe introduce shared-memory parallelism in a parallel distributed-memory solver...
We develop a technique for extracting parallelism from ordinary (sequential) programs. The technique...
This thesis introduces two tools for efficiently access data of a wavefront algorithm in an out-of-c...
Cette thèse introduit deux outils pour l'accès performant aux données d'un algorithme à front d'onde...
International audienceMatrix computation algorithms often exhibit dependencies between neighboring e...
Several numerical computation algorithms exhibit dependences that lead to a wavefront in the computa...
Several numerical computation algorithms exhibit dependences that lead to a wavefront of the computa...
High parallelism of MPSoC applications increase the need of optimization for the synchronization mec...
The solution of sparse systems of linear equations is at the heart of numerous applicationfields. Wh...
The InteGrade middleware intends to exploit the idle time of computing resources in computer laborat...
The memory usage of sparse direct solvers can be the bottleneck to solve large-scale problems involv...
To use the computational power of modern computing machines, we have to deal with concurrent program...
The authors introduced a performance model for parallel, multidimensional, wavefront calculations wi...
Chandrasekaran, SunitaProcessor architectures have been rapidly evolving for decades. From the intro...
GDR-GPLWith the slowdown of Moore's law and the end of the frequency race, the performance comes fro...
International audienceWe introduce shared-memory parallelism in a parallel distributed-memory solver...
We develop a technique for extracting parallelism from ordinary (sequential) programs. The technique...
This thesis introduces two tools for efficiently access data of a wavefront algorithm in an out-of-c...
Cette thèse introduit deux outils pour l'accès performant aux données d'un algorithme à front d'onde...
International audienceMatrix computation algorithms often exhibit dependencies between neighboring e...
Several numerical computation algorithms exhibit dependences that lead to a wavefront in the computa...
Several numerical computation algorithms exhibit dependences that lead to a wavefront of the computa...
High parallelism of MPSoC applications increase the need of optimization for the synchronization mec...
The solution of sparse systems of linear equations is at the heart of numerous applicationfields. Wh...
The InteGrade middleware intends to exploit the idle time of computing resources in computer laborat...
The memory usage of sparse direct solvers can be the bottleneck to solve large-scale problems involv...
To use the computational power of modern computing machines, we have to deal with concurrent program...
The authors introduced a performance model for parallel, multidimensional, wavefront calculations wi...
Chandrasekaran, SunitaProcessor architectures have been rapidly evolving for decades. From the intro...
GDR-GPLWith the slowdown of Moore's law and the end of the frequency race, the performance comes fro...
International audienceWe introduce shared-memory parallelism in a parallel distributed-memory solver...
We develop a technique for extracting parallelism from ordinary (sequential) programs. The technique...