Abstract—Data-intensive scientific workflows are composed of many tasks that exhibit data precedence constraints leading to communication schemes expressed by means of intermediate files. In such scenarios, the storage layer is often a bottleneck, limiting overall application scalability, due to large volumes of data being generated during runtime at high I/O rates. To alleviate the storage pressure, applications take advantage of in-memory runtime distributed file systems that act as a fast, distributed cache, which greatly enhances I/O performance. In this paper, we present scalability results for MemFS, a distributed in-memory runtime file system. MemFS takes an opposite approach to data locality, by scattering all data among the nodes, ...
The ever-increasing power of supercomputer systems is both driving and enabling the emergence of new...
In this paper we present the behavior of PAFS in a scientific environment where big parallel applica...
The adoption of low latency persistent memory modules (PMMs) upends the long-established model of re...
In many-task computing (MTC), applications such as scientific workflows or parameter sweeps communic...
In many-task computing (MTC), applications such as scientific workflows or parameter sweeps communic...
Abstract—Data-intensive scientific workflows exhibit inter-task dependencies that generate file-base...
Scientific domains such as astronomy or bioinformatics produce increasingly large amounts of data th...
Compute clusters, consisting of many, uniformly built nodes, are used to run a large spectrum of dif...
Many scientific computations can be expressed as Many-Task Computing (MTC) applications. In such sce...
Traditional cloud computing technologies, such as MapReduce, use file systems as the system-wide sub...
The decades-old concepts and assumptions behind traditional file system design have been rendered pa...
Generalizable approaches, models, and frameworks for irregular application scalability is an old yet...
Abstract Cloud computing applications require a scalable, elastic and fault tol-erant storage system...
Parallel scientific applications require high-performance I/O support from underlying file systems. ...
The advent of wide-area high-speed networks provides the framework for deploying large scale applica...
The ever-increasing power of supercomputer systems is both driving and enabling the emergence of new...
In this paper we present the behavior of PAFS in a scientific environment where big parallel applica...
The adoption of low latency persistent memory modules (PMMs) upends the long-established model of re...
In many-task computing (MTC), applications such as scientific workflows or parameter sweeps communic...
In many-task computing (MTC), applications such as scientific workflows or parameter sweeps communic...
Abstract—Data-intensive scientific workflows exhibit inter-task dependencies that generate file-base...
Scientific domains such as astronomy or bioinformatics produce increasingly large amounts of data th...
Compute clusters, consisting of many, uniformly built nodes, are used to run a large spectrum of dif...
Many scientific computations can be expressed as Many-Task Computing (MTC) applications. In such sce...
Traditional cloud computing technologies, such as MapReduce, use file systems as the system-wide sub...
The decades-old concepts and assumptions behind traditional file system design have been rendered pa...
Generalizable approaches, models, and frameworks for irregular application scalability is an old yet...
Abstract Cloud computing applications require a scalable, elastic and fault tol-erant storage system...
Parallel scientific applications require high-performance I/O support from underlying file systems. ...
The advent of wide-area high-speed networks provides the framework for deploying large scale applica...
The ever-increasing power of supercomputer systems is both driving and enabling the emergence of new...
In this paper we present the behavior of PAFS in a scientific environment where big parallel applica...
The adoption of low latency persistent memory modules (PMMs) upends the long-established model of re...