Abstract—In the data flow models of today’s data center applications such as MapReduce, Spark and Dryad, multiple flows can comprise a coflow group semantically. Only completing all flows in a coflow is meaningful to an application. To optimize application performance, routing and scheduling must be jointly considered at the level of a coflow rather than individual flows. However, prior solutions have significant limitation: they only consider scheduling, which is insufficient. To this end, we present RAPIER, a coflow-aware network optimization framework that seamlessly integrates routing and scheduling for better application performance. Using a small-scale testbed implementation and large-scale simulations, we demonstrate that RAPIER sign...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
Coflow is a network abstraction used to represent communication patterns in data centers. The coflow...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Data parallel applications in data centers generate, process, and store huge volumes of data. Coflow...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
© 2018 IEEE. Many datacenters usually process complex jobs such as MapReduce jobs. From a network pe...
Communication in data-parallel applications often involves a col-lection of parallel flows. Traditio...
This electronic version was submitted by the student author. The certified thesis is available in th...
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Datacenter networks routinely support the data transfers of distributed computing frameworks in the ...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
Coflow is a network abstraction used to represent communication patterns in data centers. The coflow...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Data parallel applications in data centers generate, process, and store huge volumes of data. Coflow...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
© 2018 IEEE. Many datacenters usually process complex jobs such as MapReduce jobs. From a network pe...
Communication in data-parallel applications often involves a col-lection of parallel flows. Traditio...
This electronic version was submitted by the student author. The certified thesis is available in th...
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Datacenter networks routinely support the data transfers of distributed computing frameworks in the ...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
Coflow is a network abstraction used to represent communication patterns in data centers. The coflow...