Distributed Stream Processing Systems (DSPS) are ``Fast Data'' platforms that allow streaming applications to be composed and executed with low latency on commodity clusters and Clouds. Such applications are composed as a Directed Acyclic Graph (DAG) of tasks, with data parallel execution using concurrent task threads on distributed resource slots. Scheduling such DAGs for DSPS has two parts-allocation of threads and resources for a DAG, and mapping threads to resources. Existing schedulers often address just one of these, make the assumption that performance linearly scales, or use ad hoc empirical tuning at runtime. Instead, we propose model-driven techniques for both mapping and allocation that rely on low-overhead a priori performance m...
Grids enable sharing, selection and aggregation of geographically distributed resources among variou...
This paper proposes a learned cost estimation model for Distributed Stream Processing Systems (DSPS)...
Abstract. This paper describes the SODA scheduler for System S, a highly scalable distributed stream...
The velocity dimension of Big Data refers to the need to rapidly process data that arrives continuou...
In the era of big data, with streaming applications such as social media, surveillance monitoring an...
The era of big data has led to the emergence of new systems for real-time distributed stream process...
ii The era of big data has led to the emergence of new systems for real-time distributed stream proc...
In this study, we investigated the problem of scheduling streaming applications on a heterogeneous c...
In the most popular distributed stream processing frameworks (DSPFs), programs are modeled as a dire...
With ever increasing data volumes, large compute clusters that process data in a distributed manner ...
Task scheduling in distributed stream computing systems is an NP-complete problem. Current schedulin...
Fog computing is rapidly changing the distributed computing landscape by extending the Cloud computi...
General-purpose Distributed Stream Data Processing Systems (DSDPSs) have attracted extensi...
This paper describes the SODA scheduler for System S, a highly scalable distributed stream processin...
Data Stream Processing (DSP) applications are widely used to timely extract information from distrib...
Grids enable sharing, selection and aggregation of geographically distributed resources among variou...
This paper proposes a learned cost estimation model for Distributed Stream Processing Systems (DSPS)...
Abstract. This paper describes the SODA scheduler for System S, a highly scalable distributed stream...
The velocity dimension of Big Data refers to the need to rapidly process data that arrives continuou...
In the era of big data, with streaming applications such as social media, surveillance monitoring an...
The era of big data has led to the emergence of new systems for real-time distributed stream process...
ii The era of big data has led to the emergence of new systems for real-time distributed stream proc...
In this study, we investigated the problem of scheduling streaming applications on a heterogeneous c...
In the most popular distributed stream processing frameworks (DSPFs), programs are modeled as a dire...
With ever increasing data volumes, large compute clusters that process data in a distributed manner ...
Task scheduling in distributed stream computing systems is an NP-complete problem. Current schedulin...
Fog computing is rapidly changing the distributed computing landscape by extending the Cloud computi...
General-purpose Distributed Stream Data Processing Systems (DSDPSs) have attracted extensi...
This paper describes the SODA scheduler for System S, a highly scalable distributed stream processin...
Data Stream Processing (DSP) applications are widely used to timely extract information from distrib...
Grids enable sharing, selection and aggregation of geographically distributed resources among variou...
This paper proposes a learned cost estimation model for Distributed Stream Processing Systems (DSPS)...
Abstract. This paper describes the SODA scheduler for System S, a highly scalable distributed stream...