© 2018 IEEE. Many datacenters usually process complex jobs such as MapReduce jobs. From a network perspective, most of these jobs trigger multiple parallel data flows, which comprise a coflow group semantically. When to schedule the jobs in datacenter or across multiple datacenters, most of current job schedulers have not considered the underlying network traffic load, which is suboptimal for jobs completion times. We present a new deadline-aware coflow scheduling approach called DCS, which takes the underlying network traffic load into consideration while guaranteeing high percentage of coflows that meet their deadlines. DCS aims to alleviate the network congestion in datacenters whose network worload are unbalanced, and it includes two st...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
MapReduce can speed up the execution of jobs operating over big data. A MapReduce job can be divided...
Due to its advantages of cost-effectiveness, on-demand provisioning and easy for sharing, cloud comp...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Datacenter networks routinely support the data transfers of distributed computing frameworks in the ...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
Communication in data-parallel applications often involves a col-lection of parallel flows. Traditio...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
Data parallel applications in data centers generate, process, and store huge volumes of data. Coflow...
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Abstract—In the data flow models of today’s data center applications such as MapReduce, Spark and Dr...
To reduce the impact of network congestion on big data jobs, cluster management frameworks use vario...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
MapReduce can speed up the execution of jobs operating over big data. A MapReduce job can be divided...
Due to its advantages of cost-effectiveness, on-demand provisioning and easy for sharing, cloud comp...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Datacenter networks routinely support the data transfers of distributed computing frameworks in the ...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
Communication in data-parallel applications often involves a col-lection of parallel flows. Traditio...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
Data parallel applications in data centers generate, process, and store huge volumes of data. Coflow...
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Abstract—In the data flow models of today’s data center applications such as MapReduce, Spark and Dr...
To reduce the impact of network congestion on big data jobs, cluster management frameworks use vario...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
MapReduce can speed up the execution of jobs operating over big data. A MapReduce job can be divided...
Due to its advantages of cost-effectiveness, on-demand provisioning and easy for sharing, cloud comp...