In a recent paper it was shown how memory traffic can be diminished by reformulating the classic algorithm for reducing a matrix to bidiagonal form, a preprocess when computing the singular values of a dense matrix. The key is a reordering of the computation so that the most memory-intensive operations can be “fused.” In this article, we show that other operations that reduce matrices to condensed form (reduction to upper Hessenberg form and reduction to tridiagonal form) can be similarly reorganized, yielding different sets of operations that can be fused. By developing the algorithms with a common framework and notation, we facilitate the comparing and contrasting of the different algorithms and opportunities for optimization on sequentia...
textabstractSolution of large sparse systems of linear equations continues to be a major research ar...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
The goal of the LAPACK project is to provide efficient and portable software for dense numerical lin...
AbstractIn this paper we described block algorithms for the reduction of a real symmetric matrix to ...
In this paper we describe block algorithms for the reduction of a real symmetric matrix to tridiagon...
Most methods for calculating the SVD (singular value decomposition) require to first bidiagonalize t...
We present two variants of Moler and Stewart’s algorithm for reducing a matrix pair to Hessenberg-tr...
Abstract In this paper, a modification of the blocked algorithm for reduction to Hessenberg form is ...
AbstractWe present an idea for reducing a rectangular matrix A to bidiagonal form which is based on ...
We develop two fast algorithms for Hessenberg reduction of a structured matrix $A = D + UV^H$, where...
AbstractThe main purpose of this paper is to present a fast matrix multiplication algorithm taken fr...
Recently an extension of the class of matrices admitting a Francis type of multishift QR algorithm w...
International audienceWe study tiled algorithms for going from a " full " matrix to a condensed " ba...
International audienceWhen solving large sparse linear systems, both the amount of memory needed and...
Abstract. On many high-speed computers the dense matrix technique is preferable to sparse matrix tec...
textabstractSolution of large sparse systems of linear equations continues to be a major research ar...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
The goal of the LAPACK project is to provide efficient and portable software for dense numerical lin...
AbstractIn this paper we described block algorithms for the reduction of a real symmetric matrix to ...
In this paper we describe block algorithms for the reduction of a real symmetric matrix to tridiagon...
Most methods for calculating the SVD (singular value decomposition) require to first bidiagonalize t...
We present two variants of Moler and Stewart’s algorithm for reducing a matrix pair to Hessenberg-tr...
Abstract In this paper, a modification of the blocked algorithm for reduction to Hessenberg form is ...
AbstractWe present an idea for reducing a rectangular matrix A to bidiagonal form which is based on ...
We develop two fast algorithms for Hessenberg reduction of a structured matrix $A = D + UV^H$, where...
AbstractThe main purpose of this paper is to present a fast matrix multiplication algorithm taken fr...
Recently an extension of the class of matrices admitting a Francis type of multishift QR algorithm w...
International audienceWe study tiled algorithms for going from a " full " matrix to a condensed " ba...
International audienceWhen solving large sparse linear systems, both the amount of memory needed and...
Abstract. On many high-speed computers the dense matrix technique is preferable to sparse matrix tec...
textabstractSolution of large sparse systems of linear equations continues to be a major research ar...
This thesis considers two problems in numerical linear algebra and high performance computing (HPC):...
The goal of the LAPACK project is to provide efficient and portable software for dense numerical lin...