Compute clusters, consisting of many, uniformly built nodes, are used to run a large spectrum of different workloads, like tightly coupled (MPI) jobs, MapReduce, or graph-processing data-analytics applications, each of which with their own resource requirements. Many studies consistently highlight two types of under-utilized cluster resources: memory (up to 50%) and network. In this work, we take a step towards (software) resource disaggregation, and therefore increased resource utilization, by designing a memory scavenging technique that makes unused memory available to applications on other cluster nodes. We implement this technique in MemFSS, an inmemory distributed file system. The scavenging MemFSS extends its storage space by taking a...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Tightly-coupled HPC systems have rigid memory allocation and can result in expensive memory resource...
Software-coherent, distributed shared memory has received conciderable amount of attention as an att...
In many-task computing (MTC), applications such as scientific workflows or parameter sweeps communic...
In many-task computing (MTC), applications such as scientific workflows or parameter sweeps communic...
Abstract—Data-intensive scientific workflows are composed of many tasks that exhibit data precedence...
Amid a data revolution that is transforming industries around the globe, computing systems have unde...
Abstract—Current commercial solutions intended to provide additional resources to an application bei...
Improvements in parallel computing hardware usually involve increments in the number of available re...
Part 4: Memory System DesignInternational audienceIn the last decades, the increasing amount of reso...
Main memory in clusters may dominate total system power. The resulting energy consumption increases ...
In a cluster system with dynamic load sharing support, a job submission or migration to a workstatio...
Abstract—Data-intensive scientific workflows exhibit inter-task dependencies that generate file-base...
Main memory in clusters may dominate total system power. The resulting energy consumption increases ...
Large scientific parallel applications demand large amounts of memory space. Current parallel comput...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Tightly-coupled HPC systems have rigid memory allocation and can result in expensive memory resource...
Software-coherent, distributed shared memory has received conciderable amount of attention as an att...
In many-task computing (MTC), applications such as scientific workflows or parameter sweeps communic...
In many-task computing (MTC), applications such as scientific workflows or parameter sweeps communic...
Abstract—Data-intensive scientific workflows are composed of many tasks that exhibit data precedence...
Amid a data revolution that is transforming industries around the globe, computing systems have unde...
Abstract—Current commercial solutions intended to provide additional resources to an application bei...
Improvements in parallel computing hardware usually involve increments in the number of available re...
Part 4: Memory System DesignInternational audienceIn the last decades, the increasing amount of reso...
Main memory in clusters may dominate total system power. The resulting energy consumption increases ...
In a cluster system with dynamic load sharing support, a job submission or migration to a workstatio...
Abstract—Data-intensive scientific workflows exhibit inter-task dependencies that generate file-base...
Main memory in clusters may dominate total system power. The resulting energy consumption increases ...
Large scientific parallel applications demand large amounts of memory space. Current parallel comput...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
Tightly-coupled HPC systems have rigid memory allocation and can result in expensive memory resource...
Software-coherent, distributed shared memory has received conciderable amount of attention as an att...