The bulge-chasing kernel in the small-bulge multi-shift QR algorithm for the non-symmetric dense eigenvalue problem becomes a sequential bottleneck when the QR algorithm is run in parallel on a multicore platform with shared memory. The duration of each kernel invocation is short, but the critical path of the QR algorithm contains a long sequence of calls to the bulge-chasing kernel. We study the problem of parallelizing the bulge-chasing kernel itself across a handful of processor cores in order to reduce the execution time of the critical path. We propose and evaluate a sequence of four algorithms with varying degrees of complexity and verify that a pipelined algorithm with a slowly shifting block column distribution of the Hessenberg mat...
International audienceThis paper describes a new QR factorization algorithm which is especially desi...
International audienceThe advent of multicore processors represents a disruptive event in the histor...
We present the techniques of adaptive blocking and incremental condition estimation which we believ...
The QR algorithm is the method of choice for computing all eigenvalues of a dense nonsymmetric matri...
A novel variant of the parallel QR algorithm for solving dense nonsymmetric eigenvalue problems on h...
Library software implementing a parallel small-bulge multishift QR algorithm with Aggressive Early D...
Library software implementing a parallel small-bulge multishift QR algorithm with Aggressive Early D...
One approach to solving the nonsymmetric eigenvalue problem in parallel is to parallelize the QR alg...
This paper presents two modifications to the multi-shift QR algorithm that significantly increase it...
This paper introduces a new parallel QR decomposition algorithm. The novel load balancing method des...
Bibliography: pages [162] - 163.The parallel QR algorithm of Datta (with and without shifting and de...
The solution of dense systems of linear equations is at the heart of numerical computations. Such sy...
The QR algorithm is one of the three phases in the process of computing the eigenvalues and the eige...
International audienceThis paper describes a new QR factorization algorithm which is especially desi...
The role of larger bulges in the QR algorithm is controversial. Large bulges are infamous for having...
International audienceThis paper describes a new QR factorization algorithm which is especially desi...
International audienceThe advent of multicore processors represents a disruptive event in the histor...
We present the techniques of adaptive blocking and incremental condition estimation which we believ...
The QR algorithm is the method of choice for computing all eigenvalues of a dense nonsymmetric matri...
A novel variant of the parallel QR algorithm for solving dense nonsymmetric eigenvalue problems on h...
Library software implementing a parallel small-bulge multishift QR algorithm with Aggressive Early D...
Library software implementing a parallel small-bulge multishift QR algorithm with Aggressive Early D...
One approach to solving the nonsymmetric eigenvalue problem in parallel is to parallelize the QR alg...
This paper presents two modifications to the multi-shift QR algorithm that significantly increase it...
This paper introduces a new parallel QR decomposition algorithm. The novel load balancing method des...
Bibliography: pages [162] - 163.The parallel QR algorithm of Datta (with and without shifting and de...
The solution of dense systems of linear equations is at the heart of numerical computations. Such sy...
The QR algorithm is one of the three phases in the process of computing the eigenvalues and the eige...
International audienceThis paper describes a new QR factorization algorithm which is especially desi...
The role of larger bulges in the QR algorithm is controversial. Large bulges are infamous for having...
International audienceThis paper describes a new QR factorization algorithm which is especially desi...
International audienceThe advent of multicore processors represents a disruptive event in the histor...
We present the techniques of adaptive blocking and incremental condition estimation which we believ...