Funding: This work is partially supported by the EU H2020 project DARE, No. 777413; and by Google Cloud Platform research credits program.This work presents three new adaptive optimization techniques to maximize the performance of dispel4py workflows. dispel4py is a parallel Python-based stream-oriented dataflow framework that acts as a bridge to existing parallel programming frameworks like MPI or Python multiprocessing. When a user runs a dispel4py workflow, the original framework performs a fixed workload distribution among the processes available for the run. This allocation does not take into account the features of the workflows, which can cause scalability issues, especially for data-intensive scientific workflows. Our aim, therefore...
Big data processing applications are becoming more and more complex. They are no more monolithic in ...
International audienceThe recent rapid expansion of Cloud computing facilities triggers an attendant...
We present Asterism, an open source data-intensive framework, which combines the strengths of tradit...
This work presents three new adaptive optimization techniques to maximize the performance of dispel4...
Abstract—This work presents three new adaptive optimizationtechniques to maximize the performance of...
Scientific workflows bridge scientific challenges with computational resources. While dispel4py, a s...
This paper presents dispel4py, a new Python framework for describing abstract stream-based workflows...
This paper presents dispel4py, a new Python framework for describing abstract stream-based workflows...
With the advancement in science and technology numerous complex scientific applications can be exec...
We present dispel4py, a novel data intensive and high performance computing middleware provided as a...
The emergence of data-intensive science as the fourth science paradigm has posed a data deluge chal...
Large-scale data-intensive streaming applications in various science fields feature complex DAG-stru...
Workflow techniques have been widely used as a major computing solution in many science domains. Wit...
Workflows are widely used in applications that require coordinated use of computational resources. W...
International audienceStream processing systems (SPS) have to deal with highly dynamic scenarios whe...
Big data processing applications are becoming more and more complex. They are no more monolithic in ...
International audienceThe recent rapid expansion of Cloud computing facilities triggers an attendant...
We present Asterism, an open source data-intensive framework, which combines the strengths of tradit...
This work presents three new adaptive optimization techniques to maximize the performance of dispel4...
Abstract—This work presents three new adaptive optimizationtechniques to maximize the performance of...
Scientific workflows bridge scientific challenges with computational resources. While dispel4py, a s...
This paper presents dispel4py, a new Python framework for describing abstract stream-based workflows...
This paper presents dispel4py, a new Python framework for describing abstract stream-based workflows...
With the advancement in science and technology numerous complex scientific applications can be exec...
We present dispel4py, a novel data intensive and high performance computing middleware provided as a...
The emergence of data-intensive science as the fourth science paradigm has posed a data deluge chal...
Large-scale data-intensive streaming applications in various science fields feature complex DAG-stru...
Workflow techniques have been widely used as a major computing solution in many science domains. Wit...
Workflows are widely used in applications that require coordinated use of computational resources. W...
International audienceStream processing systems (SPS) have to deal with highly dynamic scenarios whe...
Big data processing applications are becoming more and more complex. They are no more monolithic in ...
International audienceThe recent rapid expansion of Cloud computing facilities triggers an attendant...
We present Asterism, an open source data-intensive framework, which combines the strengths of tradit...