We discuss timing and performance modeling of a routine to find all the eigenvalues and eigenvectors of a dense symmetric matrix on distributed memory computers. The routine, PDSYEVX, is part of the ScaLAPACK library. It is based on bisection and inverse iteration, but is not designed to guarantee orthogonality of eigenvectors in the presence of clustered eigenvalues. We use our validated performance model to conclude that PDSYEVX is very efficient for large enough problem sizes, nearly independently of input and output data layouts. However, efficiency will be low if interprocessor communication is too slow, such as on conventional workstation networks, or if per processor memory is too small, such as on the Intel Gamma. Modeling also help...
AbstractSolving dense symmetric eigenvalue problems and computing singular value decompositions cont...
This report demonstrates parallel versions of the Eispack functions TRED2 and TQL2 for finding all...
The transputer is a fast microprocessor, unique in its linking ability to provide a framework for bu...
The goal of the PRISM project is the development of infrastructure and algorithms for the parallel s...
In the present work we describe HPEC (High Performance Eigenvalues Computation), a parallel software...
. In this paper, we present preliminary results on a complete eigensolver based on the Yau and Lu me...
In this paper, we present preliminary results on a complete eigensolver based on the Yau and Lu meth...
In this paper, parallel extensions of a complete symmetric eigensolver, proposed by Yau and Lu in 19...
. We give an overview of the Invariant Subspace Decomposition Algorithm for dense symmetric matrices...
In the present work we describe HPEC (High Performance Eigenvalues Computation), a parallel software...
This dissertation discusses parallel algorithms for the generalized eigenvalue problem Ax = λBx wher...
We present a new parallel implementation of a divide and conquer algorithm for computing the spectra...
Obtaining the eigenvalues and eigenvectors of large matrices is a key problem in electronic structur...
Abstract. We present a new parallel implementation of a divide and conquer algorithm for computing t...
Complex symmetric matrices often appear in quantum physics in the solution methods of partial differ...
AbstractSolving dense symmetric eigenvalue problems and computing singular value decompositions cont...
This report demonstrates parallel versions of the Eispack functions TRED2 and TQL2 for finding all...
The transputer is a fast microprocessor, unique in its linking ability to provide a framework for bu...
The goal of the PRISM project is the development of infrastructure and algorithms for the parallel s...
In the present work we describe HPEC (High Performance Eigenvalues Computation), a parallel software...
. In this paper, we present preliminary results on a complete eigensolver based on the Yau and Lu me...
In this paper, we present preliminary results on a complete eigensolver based on the Yau and Lu meth...
In this paper, parallel extensions of a complete symmetric eigensolver, proposed by Yau and Lu in 19...
. We give an overview of the Invariant Subspace Decomposition Algorithm for dense symmetric matrices...
In the present work we describe HPEC (High Performance Eigenvalues Computation), a parallel software...
This dissertation discusses parallel algorithms for the generalized eigenvalue problem Ax = λBx wher...
We present a new parallel implementation of a divide and conquer algorithm for computing the spectra...
Obtaining the eigenvalues and eigenvectors of large matrices is a key problem in electronic structur...
Abstract. We present a new parallel implementation of a divide and conquer algorithm for computing t...
Complex symmetric matrices often appear in quantum physics in the solution methods of partial differ...
AbstractSolving dense symmetric eigenvalue problems and computing singular value decompositions cont...
This report demonstrates parallel versions of the Eispack functions TRED2 and TQL2 for finding all...
The transputer is a fast microprocessor, unique in its linking ability to provide a framework for bu...