Emerging distributed applications, such as big data analytics, generate a large number of flows that concurrently transport data across data center networks. To improve their performance, it is required to account for the behavior of such a collection of flows, i.e., coflows, rather than individual ones. State-of-the-art solutions achieve near-optimal completion time by continuously reordering unfinished coflows at the end-host and using network priorities.This paper shows that dynamically changing flow priorities at the end-host, without considering in-flight packets, can cause high degrees of packet reordering, thus imposing pressure on the congestion control and potentially harming network performance in the presence of switches with sha...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
© 2018 IEEE. Many datacenters usually process complex jobs such as MapReduce jobs. From a network pe...
Data parallel applications in data centers generate, process, and store huge volumes of data. Coflow...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Abstract—In the data flow models of today’s data center applications such as MapReduce, Spark and Dr...
This electronic version was submitted by the student author. The certified thesis is available in th...
Communication in data-parallel applications often involves a col-lection of parallel flows. Traditio...
Datacenter networks routinely support the data transfers of distributed computing frameworks in the ...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
© 2018 IEEE. Many datacenters usually process complex jobs such as MapReduce jobs. From a network pe...
Data parallel applications in data centers generate, process, and store huge volumes of data. Coflow...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Abstract—In the data flow models of today’s data center applications such as MapReduce, Spark and Dr...
This electronic version was submitted by the student author. The certified thesis is available in th...
Communication in data-parallel applications often involves a col-lection of parallel flows. Traditio...
Datacenter networks routinely support the data transfers of distributed computing frameworks in the ...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...