This paper examines the feasibility of dynamic rescheduling techniques for effectively utilizing compute resources within a data center. Our work is motivated by practical concerns of Intel’s NetBatch system, an Internet-scale data center based distributed computing platform developed by Intel Corporation for massively parallel chip simulations within the company. NetBatch has been operational for many years, and currently is deployed live on tens of thousands of machines that are globally distributed at various data centers. We perform an analysis of job execution traces obtained over a one year period collected from tens of thousands of NetBatch machines from 20 different pools. Our analysis show that we observe that the NetBatch currentl...
As the demand for high-performance computing (HPC) resources has increased in the field of computati...
In a cluster system with dynamic load sharing support, a job submission or migration to a workstatio...
International audienceEffectively mapping tasks of High Performance Computing (HPC) applications on ...
This paper examines the feasibility of dynamic rescheduling techniques for effectively utilizing com...
We perform a trace-driven analysis of the Intel Distributed Computing Platform (IDCP), an Internet-s...
AbstractScheduling is a key component for performance guarantees in the case of distributed applicat...
A major performance issue in large-scale decentralized distributed systems, such as grids, is how to...
Modern high-performance computing (HPC) system designs have converged to heavyweight nodes with grow...
Real-time resource scheduling is an important factor for improving the performance of cluster comput...
In the past few years, we have envisioned an increasing number of businesses start driving by big da...
Motived by frequent failures in cloud computing systems, we aim to demystify the underlying reschedu...
Purpose: Although proactive fault handling plans are widely spread, many unexpected data center outa...
The recent trend towards more programmable switching hardware in data centers opens up new possibili...
Highly dynamic environments like clouds by nature cause a high degree of unpredictability of resourc...
The recent trend towards more programmable switching hardware in data centers opens up new possibili...
As the demand for high-performance computing (HPC) resources has increased in the field of computati...
In a cluster system with dynamic load sharing support, a job submission or migration to a workstatio...
International audienceEffectively mapping tasks of High Performance Computing (HPC) applications on ...
This paper examines the feasibility of dynamic rescheduling techniques for effectively utilizing com...
We perform a trace-driven analysis of the Intel Distributed Computing Platform (IDCP), an Internet-s...
AbstractScheduling is a key component for performance guarantees in the case of distributed applicat...
A major performance issue in large-scale decentralized distributed systems, such as grids, is how to...
Modern high-performance computing (HPC) system designs have converged to heavyweight nodes with grow...
Real-time resource scheduling is an important factor for improving the performance of cluster comput...
In the past few years, we have envisioned an increasing number of businesses start driving by big da...
Motived by frequent failures in cloud computing systems, we aim to demystify the underlying reschedu...
Purpose: Although proactive fault handling plans are widely spread, many unexpected data center outa...
The recent trend towards more programmable switching hardware in data centers opens up new possibili...
Highly dynamic environments like clouds by nature cause a high degree of unpredictability of resourc...
The recent trend towards more programmable switching hardware in data centers opens up new possibili...
As the demand for high-performance computing (HPC) resources has increased in the field of computati...
In a cluster system with dynamic load sharing support, a job submission or migration to a workstatio...
International audienceEffectively mapping tasks of High Performance Computing (HPC) applications on ...