The increasing complexity of new parallel architectures has widened the gap between adaptability and efficiency of the codes. As high performance numerical libraries tend to focus more on performance, we wish to address this issue using a C++ library called NT2. By analyzing the properties of the linear algebra domain that can be extracted from numerical libraries and combining them with architectural features, we developed a generic approach to solve dense linear systems on various architectures including CPU and GPU. We have then extended our work with an example of a least squares solver based on semi-normal equations in mixed precision that cannot be found in current libraries. For the automatically generated solvers, we report performa...
Get to know two different techniques in retrieving parallelism hidden in a general purpose linear pr...
We present several algorithms to compute the solution of a linear system of equations on a graphics ...
International audienceIn this talk we will discuss our research activities on the design of parallel...
The increasing complexity of new parallel architectures has widened the gap between adaptability and...
International audienceThe increasing complexity of new parallel architectures has widened the gap be...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
Enabling large scale use of GPU-based architectures for high performance computational science depen...
Parallelism in today's computer architectures is ubiquitous whether it be in supercomputers, worksta...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
We present a new C++ library design for linear algebra computations on high performance architecture...
Les architectures parallèles sont aujourd'hui présentes dans tous les systèmes informatiques, allant...
International audienceDirect and iterative methods are often used to solve linear systems in enginee...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Get to know two different techniques in retrieving parallelism hidden in a general purpose linear pr...
We present several algorithms to compute the solution of a linear system of equations on a graphics ...
International audienceIn this talk we will discuss our research activities on the design of parallel...
The increasing complexity of new parallel architectures has widened the gap between adaptability and...
International audienceThe increasing complexity of new parallel architectures has widened the gap be...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
Enabling large scale use of GPU-based architectures for high performance computational science depen...
Parallelism in today's computer architectures is ubiquitous whether it be in supercomputers, worksta...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
The original publication is available at www.springerlink.comInternational audienceA wide class of g...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
We present a new C++ library design for linear algebra computations on high performance architecture...
Les architectures parallèles sont aujourd'hui présentes dans tous les systèmes informatiques, allant...
International audienceDirect and iterative methods are often used to solve linear systems in enginee...
Extended version of EuroGPU symposium article, in the International Conference on Parallel Computing...
Get to know two different techniques in retrieving parallelism hidden in a general purpose linear pr...
We present several algorithms to compute the solution of a linear system of equations on a graphics ...
International audienceIn this talk we will discuss our research activities on the design of parallel...