In this paper we study the tradeoff between parallelism and communication cost in a map-reduce computation. For any problem that is not “embarrassingly parallel, ” the finer we partition the work of the reducers so that more parallelism can be extracted, the greater will be the total communica-tion between mappers and reducers. We introduce a model of problems that can be solved in a single round of map-reduce computation. This model enables a generic recipe for discovering lower bounds on communication cost as a function of the maximum number of inputs that can be as-signed to one reducer. We use the model to analyze the tradeoff for three problems: finding pairs of strings at Ham-ming distance d, finding triangles and other patterns in a ...
Communication (data movement) often dominates a computation's runtime and energy costs, motivating o...
A parallel algorithm has perfect strong scaling if its running time on P processors is linear in 1/P...
We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling t...
Thesis (M.S.)--Wichita State University, College of Engineering, Dept. of Electrical Engineering and...
We present lower bounds on the amount of communication that matrix multiplication algorithms must pe...
The movement of data (communication) between levels of a memory hierarchy, or between parallel proce...
In this paper we propose a new approach to the study of the communication requirements of distribute...
In this paper we propose a new approach to the study of the communication requirements of distribute...
Sparse matrix operations dominate the cost of many scientific applications. In parallel, the perform...
We give algorithms for geometric graph problems in the modern parallel models such as MapReduce [DG0...
Graph expansion analysis of computational DAGs is useful for obtaining communication cost lower boun...
We study the effect of limited communication throughput on parallel computation in a setting where t...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
This work explores fundamental modeling and algorithmic issues arising in the well-established MapRe...
Communication (data movement) often dominates a computation's runtime and energy costs, motivating o...
A parallel algorithm has perfect strong scaling if its running time on P processors is linear in 1/P...
We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling t...
Thesis (M.S.)--Wichita State University, College of Engineering, Dept. of Electrical Engineering and...
We present lower bounds on the amount of communication that matrix multiplication algorithms must pe...
The movement of data (communication) between levels of a memory hierarchy, or between parallel proce...
In this paper we propose a new approach to the study of the communication requirements of distribute...
In this paper we propose a new approach to the study of the communication requirements of distribute...
Sparse matrix operations dominate the cost of many scientific applications. In parallel, the perform...
We give algorithms for geometric graph problems in the modern parallel models such as MapReduce [DG0...
Graph expansion analysis of computational DAGs is useful for obtaining communication cost lower boun...
We study the effect of limited communication throughput on parallel computation in a setting where t...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
Multiplication of a sparse matrix with a dense matrix is a building block of an increasing number of...
This work explores fundamental modeling and algorithmic issues arising in the well-established MapRe...
Communication (data movement) often dominates a computation's runtime and energy costs, motivating o...
A parallel algorithm has perfect strong scaling if its running time on P processors is linear in 1/P...
We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling t...