International audienceIn this work, numerical algebraic operations are performed by using several libraries whose algorithm are optimized to drain resources from hardware architecture. In particular, dot product of two vectors and the matrix-matrix product of two dense matrices are computed. In addition, the Cholesky decomposition on a real, symmetric, and positive definite matrix is performed through routines for band and sparse matrix storage. The involved CPU time is used as an indicator of the performance of the employed numerical tool. Results are compared to naive implementations of the same numerical algorithm, highlighting the speed-up due to the usage of optimized routines
The bulk synchronous parallel (BSP) model promises scalable and portable software for a wide range o...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
In this thesis, I explore an approach called "active libraries". These are libraries that take part ...
AbstractThe increasing availability of advanced-architecture computers has a significant effect on a...
This dissertation incorporates two research projects: performance modeling and prediction for dense ...
We survey general techniques and open problems in numerical linear algebra on parallel architectures...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
In this paper we consider the data distribution and data movement issues related to the solution of ...
This report has been developed over the work done in the deliverable [Nava94] There it was shown tha...
The efficiency of numerical libraries for a given computation is highly dependent on the size of the...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
This paper discusses the scalability of Cholesky, LU, and QR factorization routines on MIMD distribu...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
The bulk synchronous parallel (BSP) model promises scalable and portable software for a wide range o...
Matrix computations lie at the heart of most scientific computational tasks. The solution of linear ...
The bulk synchronous parallel (BSP) model promises scalable and portable software for a wide range o...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
In this thesis, I explore an approach called "active libraries". These are libraries that take part ...
AbstractThe increasing availability of advanced-architecture computers has a significant effect on a...
This dissertation incorporates two research projects: performance modeling and prediction for dense ...
We survey general techniques and open problems in numerical linear algebra on parallel architectures...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
In this paper we consider the data distribution and data movement issues related to the solution of ...
This report has been developed over the work done in the deliverable [Nava94] There it was shown tha...
The efficiency of numerical libraries for a given computation is highly dependent on the size of the...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
This paper discusses the scalability of Cholesky, LU, and QR factorization routines on MIMD distribu...
This paper discusses the design of linear algebra libraries for high performance computers. Particul...
The bulk synchronous parallel (BSP) model promises scalable and portable software for a wide range o...
Matrix computations lie at the heart of most scientific computational tasks. The solution of linear ...
The bulk synchronous parallel (BSP) model promises scalable and portable software for a wide range o...
The recent dramatic progress in machine learning is partially attributed to the availability of high...
In this thesis, I explore an approach called "active libraries". These are libraries that take part ...