This work presents three new adaptive optimization techniques to maximize the performance of dispel4py workflows. dispel4py is a parallel Python-based stream-oriented dataflow framework that acts as a bridge to existing parallel programming frameworks like MPI or Python multiprocessing. When a user runs a dispel4py workflow, the original framework performs a fixed workload distribution among the processes available for the run. This allocation does not take into account the features of the workflows, which can cause scalability issues, especially for data-intensive scientific workflows. Our aim, therefore, is to improve the performance of dispel4py workflows by testing different workload strategies that automatically adapt to workflows at r...
Thesis (Ph.D.), Department of Computer Science, Washington State UniversityLarge computing systems i...
Big data processing applications are becoming more and more complex. They are no more monolithic in ...
As more and more data can be generated at a fasterthan- ever rate nowadays, it becomes a challenge t...
Funding: This work is partially supported by the EU H2020 project DARE, No. 777413; and by Google Cl...
Abstract—This work presents three new adaptive optimizationtechniques to maximize the performance of...
This paper presents dispel4py, a new Python framework for describing abstract stream-based workflows...
This paper presents dispel4py, a new Python framework for describing abstract stream-based workflows...
Scientific workflows bridge scientific challenges with computational resources. While dispel4py, a s...
We present dispel4py, a novel data intensive and high performance computing middleware provided as a...
With the advancement in science and technology numerous complex scientific applications can be exec...
Large-scale data-intensive streaming applications in various science fields feature complex DAG-stru...
There is emerging interest in many scientific disciplines to deal with “dynamic” data, arising from ...
Workflows are widely used in applications that require coordinated use of computational resources. W...
Today’s data bonanza and increasing computational power provide many new opportunities for combining...
Workflows are widely used in applications that require coordinated use of computational resources. W...
Thesis (Ph.D.), Department of Computer Science, Washington State UniversityLarge computing systems i...
Big data processing applications are becoming more and more complex. They are no more monolithic in ...
As more and more data can be generated at a fasterthan- ever rate nowadays, it becomes a challenge t...
Funding: This work is partially supported by the EU H2020 project DARE, No. 777413; and by Google Cl...
Abstract—This work presents three new adaptive optimizationtechniques to maximize the performance of...
This paper presents dispel4py, a new Python framework for describing abstract stream-based workflows...
This paper presents dispel4py, a new Python framework for describing abstract stream-based workflows...
Scientific workflows bridge scientific challenges with computational resources. While dispel4py, a s...
We present dispel4py, a novel data intensive and high performance computing middleware provided as a...
With the advancement in science and technology numerous complex scientific applications can be exec...
Large-scale data-intensive streaming applications in various science fields feature complex DAG-stru...
There is emerging interest in many scientific disciplines to deal with “dynamic” data, arising from ...
Workflows are widely used in applications that require coordinated use of computational resources. W...
Today’s data bonanza and increasing computational power provide many new opportunities for combining...
Workflows are widely used in applications that require coordinated use of computational resources. W...
Thesis (Ph.D.), Department of Computer Science, Washington State UniversityLarge computing systems i...
Big data processing applications are becoming more and more complex. They are no more monolithic in ...
As more and more data can be generated at a fasterthan- ever rate nowadays, it becomes a challenge t...