International audienceThe increasing complexity of new parallel architectures has widened the gap between adaptability and efficiency of the codes. As high performance numerical libraries tend to focus more on performance, we wish to address this issue using a C++ library called NT2. By analyzing the properties of the linear algebra domain that can be extracted from numerical libraries and combining them with architectural features, we developed a generic approach to solve dense linear systems on various architectures including CPU and GPU. We have then extended our work with an example of a least squares solver based on semi-normal equations in mixed precision that cannot be found in current libraries. For the automatically generated solve...
Enabling large scale use of GPU-based architectures for high performance computational science depen...
Get to know two different techniques in retrieving parallelism hidden in a general purpose linear pr...
We address some key issues in designing dense linear algebra (DLA) algorithms that are common for bo...
International audienceThe increasing complexity of new parallel architectures has widened the gap be...
The increasing complexity of new parallel architectures has widened the gap between adaptability and...
Parallelism in today's computer architectures is ubiquitous whether it be in supercomputers, worksta...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
In a previous PPoPP paper we showed how the FLAME method-ology, combined with the SuperMatrix runtim...
Abstract: Few realize that, for large matrices, many dense matrix computations achieve nearly the sa...
Les architectures parallèles sont aujourd'hui présentes dans tous les systèmes informatiques, allant...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
International audienceHighly structured sparse matrices arise frequently from numerical discretizati...
Abstract. If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced ...
If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accel...
Enabling large scale use of GPU-based architectures for high performance computational science depen...
Get to know two different techniques in retrieving parallelism hidden in a general purpose linear pr...
We address some key issues in designing dense linear algebra (DLA) algorithms that are common for bo...
International audienceThe increasing complexity of new parallel architectures has widened the gap be...
The increasing complexity of new parallel architectures has widened the gap between adaptability and...
Parallelism in today's computer architectures is ubiquitous whether it be in supercomputers, worksta...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
In a previous PPoPP paper we showed how the FLAME method-ology, combined with the SuperMatrix runtim...
Abstract: Few realize that, for large matrices, many dense matrix computations achieve nearly the sa...
Les architectures parallèles sont aujourd'hui présentes dans tous les systèmes informatiques, allant...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
International audienceHighly structured sparse matrices arise frequently from numerical discretizati...
Abstract. If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced ...
If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accel...
Enabling large scale use of GPU-based architectures for high performance computational science depen...
Get to know two different techniques in retrieving parallelism hidden in a general purpose linear pr...
We address some key issues in designing dense linear algebra (DLA) algorithms that are common for bo...