International audienceIn this paper, we present a method for overlapping communications on parallel computers for pipelined algorithms. We first introduce a general theoretical model which leads to a generic computation scheme for the optimal packet size. Then, we use the OPIUM library, which provides an easy-to-use and efficient way to compute, in the general case, this optimal packet size, on the column $LU$ factorization; the implementation and performance measures are made on an Intel Paragon
. Efficient communication in networks is a prerequisite to exploit the performance of large parallel...
[[abstract]]We discuss how to design parallel algorithms based upon the divide-and-conquer strategy....
[[abstract]]The basic concept of piplined data-parallel algorithms is introduced by contrasting the ...
International audienceIn this paper, we present a method for overlapping communications on parallel ...
This paper presents some works on the LU factorization from the ScaLAPACK library. First, a complexi...
This paper presents a parallel LU factorization algorithm designed to take advantage of physical bro...
In this book chapter, the authors discuss some important communication issues to obtain a highly sca...
In this paper, we study the various communication algorithms on the pipeline multicomputer. We show ...
Pipelining is normally associated with shared memory and vector computers and rarely used as an algo...
Parallelizing sparse irregular application on distributed memory systems poses serious scalability c...
The aim of this thesis is the study of different methods to minimize the communication overhead due ...
In order to have an optimal execution time of a program running on a multiprocessor system, the pro...
In this paper, we make efficient use of pipelining on LU decomposition with pivoting and a column-sc...
[[abstract]]A methodology for designing pipelined data-parallel algorithms on multicomputers is stud...
Among the various paradigms of parallelization, pipeline parallelism has the advantage of maintainin...
. Efficient communication in networks is a prerequisite to exploit the performance of large parallel...
[[abstract]]We discuss how to design parallel algorithms based upon the divide-and-conquer strategy....
[[abstract]]The basic concept of piplined data-parallel algorithms is introduced by contrasting the ...
International audienceIn this paper, we present a method for overlapping communications on parallel ...
This paper presents some works on the LU factorization from the ScaLAPACK library. First, a complexi...
This paper presents a parallel LU factorization algorithm designed to take advantage of physical bro...
In this book chapter, the authors discuss some important communication issues to obtain a highly sca...
In this paper, we study the various communication algorithms on the pipeline multicomputer. We show ...
Pipelining is normally associated with shared memory and vector computers and rarely used as an algo...
Parallelizing sparse irregular application on distributed memory systems poses serious scalability c...
The aim of this thesis is the study of different methods to minimize the communication overhead due ...
In order to have an optimal execution time of a program running on a multiprocessor system, the pro...
In this paper, we make efficient use of pipelining on LU decomposition with pivoting and a column-sc...
[[abstract]]A methodology for designing pipelined data-parallel algorithms on multicomputers is stud...
Among the various paradigms of parallelization, pipeline parallelism has the advantage of maintainin...
. Efficient communication in networks is a prerequisite to exploit the performance of large parallel...
[[abstract]]We discuss how to design parallel algorithms based upon the divide-and-conquer strategy....
[[abstract]]The basic concept of piplined data-parallel algorithms is introduced by contrasting the ...