International audienceThe polyhedral model permits to automatically improve data locality and enable parallelism of regular linear algebra kernels. In previous work we have proposed a new data structure, 2d-packed layout, to store only the non-zeros elements of regular sparse (triangular and banded) matrices dynamically allocated for different basic linear algebra operations, and used Pluto to parallelize and optimize them. To our surprise, there were huge discrepancies in our measures of these kernels execution times that were due to the allocation mode: as statically declared arrays or as dynamically allocated arrays of pointers.In this paper we compare the performance of various linear algebra kernels, including some linear algebra kerne...
International audienceWe consider the problem of allocating and scheduling dense linear application ...
The polyhedron model has been used successfully for auto-matic parallelization of code regions with ...
Traditional (sequential) applications use malloc for a variety of dynamic data structures, like link...
International audienceThe polyhedral model permits to automatically improve data locality and enable...
International audienceWe consider the problem of data allocation when performing matrix multiplicati...
The tremendous increase in the size and heterogeneity of supercomputers makes it very difficult to p...
International audienceThis paper adresses static resource allocation problems for irregular distribu...
(eng) We study the implementation of dense linear algebra computations, such as matrix multiplicatio...
International audienceThe tremendous increase in the size and heterogeneity of supercomputers makes ...
In this paper, we deal with redistribution issues for dense linear algebra kernels on heterogeneous ...
We compare four existing dynamic memory allocators optimized for GPUs and show their strengths and w...
(eng) In this paper, we deal with redistribution issues for dense linear algebra kernels on heteroge...
International audienceArray contraction is a compilation optimization used to reduce memory consumpt...
De nos jours, les applications d'algèbre linéraire sont couramment utilisées pour traiter des problè...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...
International audienceWe consider the problem of allocating and scheduling dense linear application ...
The polyhedron model has been used successfully for auto-matic parallelization of code regions with ...
Traditional (sequential) applications use malloc for a variety of dynamic data structures, like link...
International audienceThe polyhedral model permits to automatically improve data locality and enable...
International audienceWe consider the problem of data allocation when performing matrix multiplicati...
The tremendous increase in the size and heterogeneity of supercomputers makes it very difficult to p...
International audienceThis paper adresses static resource allocation problems for irregular distribu...
(eng) We study the implementation of dense linear algebra computations, such as matrix multiplicatio...
International audienceThe tremendous increase in the size and heterogeneity of supercomputers makes ...
In this paper, we deal with redistribution issues for dense linear algebra kernels on heterogeneous ...
We compare four existing dynamic memory allocators optimized for GPUs and show their strengths and w...
(eng) In this paper, we deal with redistribution issues for dense linear algebra kernels on heteroge...
International audienceArray contraction is a compilation optimization used to reduce memory consumpt...
De nos jours, les applications d'algèbre linéraire sont couramment utilisées pour traiter des problè...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...
International audienceWe consider the problem of allocating and scheduling dense linear application ...
The polyhedron model has been used successfully for auto-matic parallelization of code regions with ...
Traditional (sequential) applications use malloc for a variety of dynamic data structures, like link...