Linear algebra operations arise in a myriad of scientific and engineering applications and, therefore, their optimization is targeted by a significant number of high performance computing (HPC) research efforts. In particular, the matrix multiplication and the solution of linear systems are two key problems with efficient implementations (or kernels) for a variety of high per- formance parallel architectures. For these specific prob- lems, leveraging the structure of the associated matrices often leads to remarkable time and memory savings, as is the case, e.g., for symmetric band problems. In this work, we exploit the ample hardware concurrency of many-core graphics processors (GPUs) to accelerate the solution of symmetric positive definit...
The emergence of multicore and heterogeneous architectures requires many linear algebra algorithms t...
If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accel...
General purpose computing on graphics processing units (GPGPU) is fast becoming a common feature of ...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
We investigate the performance of the routines in LAPACK and the Successive Band Reduction (SBR) too...
We investigate the performance of the routines in LAPACK and the Successive Band Reduction (SBR) too...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
AbstractThe introduction of auto-tuning techniques in linear algebra routines using hybrid combinati...
Abstract Optimization algorithms are becoming increasingly more important in many areas, such as fin...
General matrix-matrix multiplications (GEMM) in vendor-supplied BLAS libraries are best optimized fo...
In this chapter, we present a hybridization methodology for the development of linear algebra softwa...
Abstract. If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced ...
Communicated by Yasuaki Ito Solution of large-scale dense nonsymmetric eigenvalue problem is require...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
We present several algorithms to compute the solution of a linear system of equations on a graphics ...
The emergence of multicore and heterogeneous architectures requires many linear algebra algorithms t...
If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accel...
General purpose computing on graphics processing units (GPGPU) is fast becoming a common feature of ...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
We investigate the performance of the routines in LAPACK and the Successive Band Reduction (SBR) too...
We investigate the performance of the routines in LAPACK and the Successive Band Reduction (SBR) too...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
AbstractThe introduction of auto-tuning techniques in linear algebra routines using hybrid combinati...
Abstract Optimization algorithms are becoming increasingly more important in many areas, such as fin...
General matrix-matrix multiplications (GEMM) in vendor-supplied BLAS libraries are best optimized fo...
In this chapter, we present a hybridization methodology for the development of linear algebra softwa...
Abstract. If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced ...
Communicated by Yasuaki Ito Solution of large-scale dense nonsymmetric eigenvalue problem is require...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
We present several algorithms to compute the solution of a linear system of equations on a graphics ...
The emergence of multicore and heterogeneous architectures requires many linear algebra algorithms t...
If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced with accel...
General purpose computing on graphics processing units (GPGPU) is fast becoming a common feature of ...