Distributed computing technologies, as popularized by Hadoop, have been proliferating in Cloud and enterprise computing over ten years, with the capability of processing data across thousands of machines. There is a wide diversity of workloads in such large scale clusters shared by multi-Tenant. Hence, resource utilization and task scheduling become vital to performance and bring challenges to architecture designers. We present Daphne, a hybrid scheduling framework that strikes a tradeoff among three universal scheduling frameworks: 1) centralized scheduling; 2) loose coordination scheduling; and 3) fully distributed scheduling. Daphne defines a matching tree that forwards the application to the fittest scheduler according to the characteri...
The MapReduce framework has become the de facto scheme for scalable semi-structured and un-structure...
Abstract. Recent success in building petascale computing systems poses new challenges in job schedul...
International audienceThis article presents a two-level strategy for scheduling large workloads of p...
Scheduling in large scale computing clusters is critical to job performance and resource utilization...
The MapReduce framework has become the defacto scheme for scalable semi-structured and un-structured...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
International audienceThis paper addresses the problem of efficient scheduling of large clusters und...
A long-standing challenge in cluster scheduling is to achieve a high degree of utilization of hetero...
The exponential growth of collected data poses the challenge of efficient data processing among othe...
Distributed data-parallel processing systems like MapReduce, Spark, and Flink are popular for analyz...
To reduce the impact of network congestion on big data jobs, cluster management frameworks use vario...
Clusters of commodity microprocessors have overtaken custom-designed systems as the high performance...
Cluster schedulers provide flexible resource sharing mechanism for best-effort cloud jobs, which occ...
(c) 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for...
Abstract This paper addresses the problem of efficient scheduling of large clusters under high load ...
The MapReduce framework has become the de facto scheme for scalable semi-structured and un-structure...
Abstract. Recent success in building petascale computing systems poses new challenges in job schedul...
International audienceThis article presents a two-level strategy for scheduling large workloads of p...
Scheduling in large scale computing clusters is critical to job performance and resource utilization...
The MapReduce framework has become the defacto scheme for scalable semi-structured and un-structured...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
International audienceThis paper addresses the problem of efficient scheduling of large clusters und...
A long-standing challenge in cluster scheduling is to achieve a high degree of utilization of hetero...
The exponential growth of collected data poses the challenge of efficient data processing among othe...
Distributed data-parallel processing systems like MapReduce, Spark, and Flink are popular for analyz...
To reduce the impact of network congestion on big data jobs, cluster management frameworks use vario...
Clusters of commodity microprocessors have overtaken custom-designed systems as the high performance...
Cluster schedulers provide flexible resource sharing mechanism for best-effort cloud jobs, which occ...
(c) 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for...
Abstract This paper addresses the problem of efficient scheduling of large clusters under high load ...
The MapReduce framework has become the de facto scheme for scalable semi-structured and un-structure...
Abstract. Recent success in building petascale computing systems poses new challenges in job schedul...
International audienceThis article presents a two-level strategy for scheduling large workloads of p...