Current systems for data-parallel, incremental processing and view maintenance over high-rate streams isolate the execution of independent queries. This creates unwanted redundancy and overhead in the presence of concurrent incrementally maintained queries: each query must independently maintain the same indexed state over the same input streams, and new queries must build this state from scratch before they can begin to emit their first results. This paper introduces shared arrangements: indexed views of maintained state that allow concurrent queries to reuse the same in-memory state without compromising data-parallel performance and scaling. We implement shared arrangements in a modern stream processor and show order-of-magnitude impro...
In the quest for valuable information, modern big data applications continuously monitor streams of ...
Today an ever increasing amount of data is collected and analyzed by researchers, businesses, and sc...
At the moment we are witnessing the maturation of distributed streaming dataflow systems whose use-c...
The processing of data streams plays a central role in emerging applications such as pervasive compu...
In relational database management systems, views supplement basic query constructs to cope with the ...
Abstract—Data streaming has become an important paradigm for the real-time processing of continuous ...
Distributed Data Stream Management Systems (DSMS) are increasingly used for the processing of high-r...
Continuous query processing has emerged as a promising query processing paradigm with numerous appli...
Today’s data deluge enables organizations to collect massive data, and analyze it with an ever-incre...
Numerous applications in for example science, engineering, and financial analysis increasingly requi...
Present-day computing systems have to deal with a continuous growth of data rate and volume. Process...
To learn from a large dataset, we generally want to perform lots of queries. If we perform each quer...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
In many data gathering applications, information arrives in the form of continuous streams rather th...
In the quest for valuable information, modern big data applications continuously monitor streams of ...
Today an ever increasing amount of data is collected and analyzed by researchers, businesses, and sc...
At the moment we are witnessing the maturation of distributed streaming dataflow systems whose use-c...
The processing of data streams plays a central role in emerging applications such as pervasive compu...
In relational database management systems, views supplement basic query constructs to cope with the ...
Abstract—Data streaming has become an important paradigm for the real-time processing of continuous ...
Distributed Data Stream Management Systems (DSMS) are increasingly used for the processing of high-r...
Continuous query processing has emerged as a promising query processing paradigm with numerous appli...
Today’s data deluge enables organizations to collect massive data, and analyze it with an ever-incre...
Numerous applications in for example science, engineering, and financial analysis increasingly requi...
Present-day computing systems have to deal with a continuous growth of data rate and volume. Process...
To learn from a large dataset, we generally want to perform lots of queries. If we perform each quer...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
In many data gathering applications, information arrives in the form of continuous streams rather th...
In the quest for valuable information, modern big data applications continuously monitor streams of ...
Today an ever increasing amount of data is collected and analyzed by researchers, businesses, and sc...
At the moment we are witnessing the maturation of distributed streaming dataflow systems whose use-c...