We present several algorithms to compute the solution of a linear system of equations on a graphics processor (GPU), as well as general techniques to improve their performance, such as padding and hybrid GPU-CPU computation. We compare single and double precision performance of a modern GPU with unified architecture, and show how iterative refinement with mixed precision can be used to regain full accuracy in the solution of linear systems, exploiting the potential of the processor for single precision arithmetic. Experimental results on a GTX280 using CUBLAS 2.0, the implementation of BLAS for NVIDIA® GPUs with unified architecture, illustrate the performance of the different algorithms and techniques proposed
General matrix-matrix multiplications with double-precision real and complex entries (DGEMM and ZGEM...
Abstract Optimization algorithms are becoming increasingly more important in many areas, such as fin...
International audienceThe Simplex algorithm is a well known method to solve linear programming (LP) ...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
We present several algorithms to compute the solution of a linear system of equations on a GPU, as ...
Modern graphics processing units (GPUs) have been at the leading edge of in-creasing chip-level para...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
By using a combination of 32-bit and 64-bit floating point arithmetic, the performance of many dense...
By using a combination of 32-bit and 64-bit floating point arithmetic, the per-formance of many dens...
Graphical Processing Units (GPUs) have become more accessible peripheral devices with great computin...
Graphical Processing Units (GPUs) have become more accessible peripheral devices with great computin...
In this paper we discuss about our experiences in improving the performance of two key algorithms: t...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
General matrix-matrix multiplications with double-precision real and complex entries (DGEMM and ZGEM...
Abstract Optimization algorithms are becoming increasingly more important in many areas, such as fin...
International audienceThe Simplex algorithm is a well known method to solve linear programming (LP) ...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
We present several algorithms to compute the solution of a linear system of equations on a GPU, as ...
Modern graphics processing units (GPUs) have been at the leading edge of in-creasing chip-level para...
International audienceBy using a combination of 32-bit and 64-bit floating point arithmetic, the per...
By using a combination of 32-bit and 64-bit floating point arithmetic, the performance of many dense...
By using a combination of 32-bit and 64-bit floating point arithmetic, the per-formance of many dens...
Graphical Processing Units (GPUs) have become more accessible peripheral devices with great computin...
Graphical Processing Units (GPUs) have become more accessible peripheral devices with great computin...
In this paper we discuss about our experiences in improving the performance of two key algorithms: t...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
to appearInternational audienceA wide class of numerical methods needs to solve a linear system, whe...
General matrix-matrix multiplications with double-precision real and complex entries (DGEMM and ZGEM...
Abstract Optimization algorithms are becoming increasingly more important in many areas, such as fin...
International audienceThe Simplex algorithm is a well known method to solve linear programming (LP) ...