We provide timing results for common linear algebra subroutines across BLAS (Basic Lin-ear Algebra Subprograms) and GPU (Graphics Processing Unit)-based implementations. Several BLAS implementations are compared. The first is the unoptimised reference BLAS which provides a baseline to measure against. Second is the Atlas tuned BLAS, configured for single-threaded mode. Third is the development version of Atlas, configured for multi-threaded mode. Fourth is the optimised and multi-threaded Goto BLAS. Fifth is the multi-threaded BLAS contained in the commercial Intel MKL package. We also measure the performance of a GPU-based implementation for R (R Development Core Team 2010a) provided by the package gputools (Buckner et˜al. 2010). Several f...
Abstract The Basic Linear Algebra Subprograms, BLAS, are the basic computa-tional kernels in most ap...
This dataset contains the execution time of four BLAS Level 1 operations - ASUM, DOT, SCAL and AXPY ...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
Timing results for BLAS (Basic Linear Algebra Subprograms) libraries in R on diverse CPUs and GPUs. ...
A current trend in high-performance computing is to decompose a large linear algebra problem into ba...
BLAS benchmark results for: CPUs: Intel Core i7-4790K, Intel Core i5-4590, Intel Core i5-4590, Inte...
International audienceIn the last ten years, GPUs have dominated the market considering the computin...
The increase in performance of the last generations of graphics processors (GPUs) has made this clas...
This work reviews the experience of implementing different versions of the SSPR rank-one update oper...
International audienceNowadays GPUs have dominated the market considering the computing/power metric...
This dataset contains the execution time of four BLAS Level 1 operations - ASUM, DOT, SCAL and AXPY ...
Basic Linear Algebra Subprograms (BLAS) and Linear Algebra Package (LAPACK) form basic building bloc...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra l...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
Abstract The Basic Linear Algebra Subprograms, BLAS, are the basic computa-tional kernels in most ap...
This dataset contains the execution time of four BLAS Level 1 operations - ASUM, DOT, SCAL and AXPY ...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...
Timing results for BLAS (Basic Linear Algebra Subprograms) libraries in R on diverse CPUs and GPUs. ...
A current trend in high-performance computing is to decompose a large linear algebra problem into ba...
BLAS benchmark results for: CPUs: Intel Core i7-4790K, Intel Core i5-4590, Intel Core i5-4590, Inte...
International audienceIn the last ten years, GPUs have dominated the market considering the computin...
The increase in performance of the last generations of graphics processors (GPUs) has made this clas...
This work reviews the experience of implementing different versions of the SSPR rank-one update oper...
International audienceNowadays GPUs have dominated the market considering the computing/power metric...
This dataset contains the execution time of four BLAS Level 1 operations - ASUM, DOT, SCAL and AXPY ...
Basic Linear Algebra Subprograms (BLAS) and Linear Algebra Package (LAPACK) form basic building bloc...
Abstract. Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major buildin...
BLIS is a new software framework for instantiating high-performance BLAS-like dense linear algebra l...
We present several algorithms to compute the solution of a linear system of equa-tions on a GPU, as ...
Abstract The Basic Linear Algebra Subprograms, BLAS, are the basic computa-tional kernels in most ap...
This dataset contains the execution time of four BLAS Level 1 operations - ASUM, DOT, SCAL and AXPY ...
We propose two high-level application programming interfaces (APIs) to use a graphics processing uni...