Communication in data-parallel applications often involves a col-lection of parallel flows. Traditional techniques to optimize flow-level metrics do not perform well in optimizing such collections, because the network is largely agnostic to application-level require-ments. The recently proposed coflow abstraction bridges this gap and creates new opportunities for network scheduling. In this pa-per, we address inter-coflow scheduling for two different objec-tives: decreasing communication time of data-intensive jobs and guaranteeing predictable communication time. We introduce the concurrent open shop scheduling with coupled resources problem, analyze its complexity, and propose effective heuristics to opti-mize either objective. We present ...
Coflow is a network abstraction used to represent communication patterns in data centers. The coflow...
We consider the matroid coflow scheduling problem, where each job is comprised of a set of flows and...
This electronic version was submitted by the student author. The certified thesis is available in th...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
© 2018 IEEE. Many datacenters usually process complex jobs such as MapReduce jobs. From a network pe...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
Data parallel applications in data centers generate, process, and store huge volumes of data. Coflow...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
Abstract—In the data flow models of today’s data center applications such as MapReduce, Spark and Dr...
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Coflow is a network abstraction used to represent communication patterns in data centers. The coflow...
We consider the matroid coflow scheduling problem, where each job is comprised of a set of flows and...
This electronic version was submitted by the student author. The certified thesis is available in th...
Over the past decade, the confluence of an unprecedented growth in data volumes and the rapid rise o...
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually gene...
Coflow is a recently proposed network abstraction to capture communication patterns in data centers....
© 2018 IEEE. Many datacenters usually process complex jobs such as MapReduce jobs. From a network pe...
Abstract — In the data flow models of today’s data center applications such as MapReduce, Spark and ...
Data parallel applications in data centers generate, process, and store huge volumes of data. Coflow...
Emerging distributed applications, such as big data analytics, generate a large number of flows that...
Abstract—In the data flow models of today’s data center applications such as MapReduce, Spark and Dr...
Thanks to the exponential growth of data that needs to be processed in cloud datacenters, data paral...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
International audienceDatacenter networks routinely support the data transfers of distributed comput...
Efficient execution of distributed database operators such as joining and aggregating is critical fo...
Coflow is a network abstraction used to represent communication patterns in data centers. The coflow...
We consider the matroid coflow scheduling problem, where each job is comprised of a set of flows and...
This electronic version was submitted by the student author. The certified thesis is available in th...