Dense linear algebra computations are essential to nearly every problem in scientific computing and to countless other fields. Most matrix computations enjoy a high computational intensity (i.e., ratio of computation to data), and therefore the algorithms for the computations have a potential for high efficiency. However, performance for many linear algebra algorithms is limited by the cost of moving data between processors on a parallel computer or throughout the memory hierarchy of a single processor, which we will refer to generally as communication. Technological trends indicate that algorithmic performance will become even more limited by communication in the future. In this thesis, we consider the fundamental computations within d...
Graph algorithms typically have very low computational intensities, hence their execution times are ...
This electronic version was submitted by the student author. The certified thesis is available in th...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
Algorithms have two costs: arithmetic and communication. The latter represents the cost of moving da...
We present lower bounds on the amount of communication that matrix multiplication algorithms must pe...
Parallel matrix multiplication is one of the most studied fun-damental problems in distributed and h...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
To efficiently scale dense linear algebra problems to future exascale systems, communication cost mu...
Parallel matrix multiplication is one of the most studied fun-damental problems in distributed and h...
International audienceModern, massively parallel computers play a fundamental role in a large and ra...
A parallel algorithm has perfect strong scaling if its running time on P processors is linear in 1/P...
Thesis (M.S.)--Wichita State University, College of Engineering, Dept. of Electrical Engineering and...
Abstract—To efficiently scale dense linear algebra problems to future exascale systems, communicatio...
The movement of data (communication) between levels of a memory hierarchy, or between parallel proce...
Graph algorithms typically have very low computational intensities, hence their execution times are ...
This electronic version was submitted by the student author. The certified thesis is available in th...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
Algorithms have two costs: arithmetic and communication. The latter represents the cost of moving da...
We present lower bounds on the amount of communication that matrix multiplication algorithms must pe...
Parallel matrix multiplication is one of the most studied fun-damental problems in distributed and h...
This thesis targets the design of parallelizable algorithms and communication-efficient parallel sch...
To efficiently scale dense linear algebra problems to future exascale systems, communication cost mu...
Parallel matrix multiplication is one of the most studied fun-damental problems in distributed and h...
International audienceModern, massively parallel computers play a fundamental role in a large and ra...
A parallel algorithm has perfect strong scaling if its running time on P processors is linear in 1/P...
Thesis (M.S.)--Wichita State University, College of Engineering, Dept. of Electrical Engineering and...
Abstract—To efficiently scale dense linear algebra problems to future exascale systems, communicatio...
The movement of data (communication) between levels of a memory hierarchy, or between parallel proce...
Graph algorithms typically have very low computational intensities, hence their execution times are ...
This electronic version was submitted by the student author. The certified thesis is available in th...
In this paper, we study the implementation of dense linear algebra kernels, such as matrix multiplic...