. The parallel computer CM-200 consists of a very large number of simple processors connected in a mesh. The peak performance is very high, but it is not clear how easy it is to write efficient programs in high-level languages. A straight-forward implementation of the Householder QR algorithm in CM Fortran is shown to be slow. Another implementation is presented, with better performance, but still not comparable to the low-level implementation in CMSSL. AMS Subject Classification : 65F05, 65F25, 65Y10 Key words: Connection Machine, Matrix Computations, QR-decomposition Matrix Computations on the CM-200 1 Introduction The Connection Machine 200 (CM) is a massively parallel SIMD computer, i.e. a computer with a very large number of simple p...
This paper presents benchmarking results for image processing algorithms on the Connection Machine m...
Some level-2 and level-3 Distributed Basic Linear Algebra Subroutines (DBLAS) that have been impleme...
For the solution of systems of linear equations with general non-Hermitian nonsingular coefficient m...
The main goal of this research is to use OpenMP, Posix Threads and Microsoft Parallel Patterns libra...
Interprocessor communication often dominates the runtime of large matrix computations. We present a ...
Using super-resolution techniques to estimate the direction that a signal arrived at a radio receive...
This report addresses several important aspects of parallel implementation of QR decomposition of a ...
International audienceInterprocessor communication often dominates the runtime of large matrix compu...
International audienceInterprocessor communication often dominates the runtime of large matrix compu...
Parallel computing on networks of workstations are intensively used in some application areas such a...
While parallel computer architectures have become mainstream, application development on them is sti...
While parallel computer architectures have become mainstream, application development on them is sti...
This paper introduces a new parallel QR decomposition algorithm. The novel load balancing method des...
[[abstract]]Numerical algorithm runtimes are increasingly dominated by the cost of communication (me...
This paper presents benchmarking results for image processing algorithms on the Connection Machine m...
This paper presents benchmarking results for image processing algorithms on the Connection Machine m...
Some level-2 and level-3 Distributed Basic Linear Algebra Subroutines (DBLAS) that have been impleme...
For the solution of systems of linear equations with general non-Hermitian nonsingular coefficient m...
The main goal of this research is to use OpenMP, Posix Threads and Microsoft Parallel Patterns libra...
Interprocessor communication often dominates the runtime of large matrix computations. We present a ...
Using super-resolution techniques to estimate the direction that a signal arrived at a radio receive...
This report addresses several important aspects of parallel implementation of QR decomposition of a ...
International audienceInterprocessor communication often dominates the runtime of large matrix compu...
International audienceInterprocessor communication often dominates the runtime of large matrix compu...
Parallel computing on networks of workstations are intensively used in some application areas such a...
While parallel computer architectures have become mainstream, application development on them is sti...
While parallel computer architectures have become mainstream, application development on them is sti...
This paper introduces a new parallel QR decomposition algorithm. The novel load balancing method des...
[[abstract]]Numerical algorithm runtimes are increasingly dominated by the cost of communication (me...
This paper presents benchmarking results for image processing algorithms on the Connection Machine m...
This paper presents benchmarking results for image processing algorithms on the Connection Machine m...
Some level-2 and level-3 Distributed Basic Linear Algebra Subroutines (DBLAS) that have been impleme...
For the solution of systems of linear equations with general non-Hermitian nonsingular coefficient m...