Data parallel applications in data centers generate, process, and store huge volumes of data. Coflow Completion time (CCT) is one of the major performance metrics to capture application-level semantics. This paper is the first one to study the joint consideration of task placement, coflow bandwidth scheduling, and path choice to minimize the average CCT in intra-data center. This paper proposes a joint online scheduling framework, which first develops a 2-approximation algorithm to reduce the CCT of a single coflow, and then follows the Shortest Remaining Time First (SRTF) principle to schedule multiple coflows. Extensive simulations based on practical trace demonstrate that the proposed framework has better performance than the state-of-th...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Datacenter networks routinely support the data transfers of distributed computing frameworks in the ...
Coflow is a network abstraction used to represent communication patterns in data centers. The coflow...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
This electronic version was submitted by the student author. The certified thesis is available in th...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
Abstract—In the data flow models of today’s data center applications such as MapReduce, Spark and Dr...
Communication in data-parallel applications often involves a col-lection of parallel flows. Traditio...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
© 2018 IEEE. Many datacenters usually process complex jobs such as MapReduce jobs. From a network pe...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Datacenter networks routinely support the data transfers of distributed computing frameworks in the ...
Coflow is a network abstraction used to represent communication patterns in data centers. The coflow...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
This electronic version was submitted by the student author. The certified thesis is available in th...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
Abstract—In the data flow models of today’s data center applications such as MapReduce, Spark and Dr...
Communication in data-parallel applications often involves a col-lection of parallel flows. Traditio...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
© 2018 IEEE. Many datacenters usually process complex jobs such as MapReduce jobs. From a network pe...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Datacenter networks routinely support the data transfers of distributed computing frameworks in the ...
Coflow is a network abstraction used to represent communication patterns in data centers. The coflow...