Distributed stream processing frameworks are designed to perform continuous computation on possibly unbounded data streams whose rates can change over time. Devising solutions to make such systems elastically scale is a fundamental goal to achieve desired performance and cut costs caused by resource over-provisioning. These systems can be scaled along two dimensions: the operator parallelism and the number of resources. In this paper, we show how these two dimensions, as two symbiotic entities, are independent but must mutually interact for the global benefit of the system. On the basis of this observation, we propose a fine-grained model for estimating the resource utilization of a stream processing application that enables the independent...
Data Stream Processing (DSP) applications should be capable to efficiently process high-velocity con...
Big data is characterized by volume and velocity [24], and recently several real- time stream proces...
An increasing number of data-driven applications rely on the ability of processing data flows in a t...
Distributed stream processing frameworks are designed to perform continuous computation on possibly ...
We describe an approach to elastically scale the per-formance of a data analytics operator that is p...
Stream Processing was recently introduced as a paradigm to easily develop and deploy applications ta...
International audienceThis paper investigates reactive elasticity in stream processing environments ...
This article addresses the profitability problem associated with auto-parallelization of general-pur...
Data stream processing systems are used to process data from high velocity data sources like financi...
ii The era of big data has led to the emergence of new systems for real-time distributed stream proc...
International audienceNowadays, more and more sources (connected devices, social networks, etc.) emi...
More and more use cases require fast, accurate, and reliable processing of large volumes of data. To...
Stream processing applications extract value from raw data through Directed Acyclic Graphs of data a...
Data Stream Processing (DSP) applications should be capable to efficiently process high-velocity con...
Big data is characterized by volume and velocity [24], and recently several real- time stream proces...
An increasing number of data-driven applications rely on the ability of processing data flows in a t...
Distributed stream processing frameworks are designed to perform continuous computation on possibly ...
We describe an approach to elastically scale the per-formance of a data analytics operator that is p...
Stream Processing was recently introduced as a paradigm to easily develop and deploy applications ta...
International audienceThis paper investigates reactive elasticity in stream processing environments ...
This article addresses the profitability problem associated with auto-parallelization of general-pur...
Data stream processing systems are used to process data from high velocity data sources like financi...
ii The era of big data has led to the emergence of new systems for real-time distributed stream proc...
International audienceNowadays, more and more sources (connected devices, social networks, etc.) emi...
More and more use cases require fast, accurate, and reliable processing of large volumes of data. To...
Stream processing applications extract value from raw data through Directed Acyclic Graphs of data a...
Data Stream Processing (DSP) applications should be capable to efficiently process high-velocity con...
Big data is characterized by volume and velocity [24], and recently several real- time stream proces...
An increasing number of data-driven applications rely on the ability of processing data flows in a t...