In this paper, we study partitioning functions for stream processing systems that employ stateful data parallelism to improve application throughput. In particular, we develop partitioning functions that are effective under workloads where the domain of the partitioning key is large and its value distribution is skewed. We define various desirable properties for partitioning functions, ranging from balance properties such as memory, processing, and communication balance, structural properties such as compactness and fast lookup, and adaptation properties such as fast computation and minimal migration. We introduce a partitioning function structure that is compact and develop several associated heuristic construction techniques that exhibit ...
More and more use cases require fast, accurate, and reliable processing of large volumes of data. To...
This thesis considers how to exploit the specific characteristics of data streaming functions and mu...
This paper presents a partitioning and allocation algorithm for an iterative stream compiler, target...
Cataloged from PDF version of article.In this paper we study partitioning functions for stream proc...
This article addresses the profitability problem associated with auto-parallelization of general-pur...
Key grouping is a technique used by stream processing frameworks to simplify the development of para...
Scalable execution of continuous queries over massive data streams often requires splitting input st...
Streaming applications process possibly infinite streams of data and often have both high throughput...
International audienceKey grouping is a technique used by stream processing frameworks to simplify t...
Stream based languages are a popular approach to expressing parallelism in modern applications. The ...
Key-based workload partitioning is now commonly used in parallel stream processing, enabling effecti...
Stream workloads vary widely, as do proposed stream ar-chitectures. We argue that stream processors ...
One of the problems proven to be NP-hard in the field of many-core architectures is the partitioning...
As more aspects of our daily lives are being computerized, ever larger amounts of data are being pro...
We study the problem of load balancing in distributed stream processing engines, which is exacerbate...
More and more use cases require fast, accurate, and reliable processing of large volumes of data. To...
This thesis considers how to exploit the specific characteristics of data streaming functions and mu...
This paper presents a partitioning and allocation algorithm for an iterative stream compiler, target...
Cataloged from PDF version of article.In this paper we study partitioning functions for stream proc...
This article addresses the profitability problem associated with auto-parallelization of general-pur...
Key grouping is a technique used by stream processing frameworks to simplify the development of para...
Scalable execution of continuous queries over massive data streams often requires splitting input st...
Streaming applications process possibly infinite streams of data and often have both high throughput...
International audienceKey grouping is a technique used by stream processing frameworks to simplify t...
Stream based languages are a popular approach to expressing parallelism in modern applications. The ...
Key-based workload partitioning is now commonly used in parallel stream processing, enabling effecti...
Stream workloads vary widely, as do proposed stream ar-chitectures. We argue that stream processors ...
One of the problems proven to be NP-hard in the field of many-core architectures is the partitioning...
As more aspects of our daily lives are being computerized, ever larger amounts of data are being pro...
We study the problem of load balancing in distributed stream processing engines, which is exacerbate...
More and more use cases require fast, accurate, and reliable processing of large volumes of data. To...
This thesis considers how to exploit the specific characteristics of data streaming functions and mu...
This paper presents a partitioning and allocation algorithm for an iterative stream compiler, target...