Due to high data volumes and unpredictable arrival rates, continuous query systems processing expensive queries in real-time may fail to keep up with the input data streams - resulting in buffer overflow and uncontrolled loss of data. In this work, we explore join direction adaptation (JDA) to tackle resource-limited processing of multi-join stream queries. While the existing JDA solutions allocate the scarce CPU resources to the most productive half-way join within a single operator, we instead leverage the operator interdependencies to optimize the overall query throughput. We identify result staleness as an impending issue in resource-limited processing, which gets further aggravated if throughput optimizing techniques are employed. For ...
One problem encountered in real-time data integration is the join of a continuous incoming data stre...
Efficient resource optimization is critical to manage the velocity and volume of real-time streaming...
Applications that involve data integration among multiple sources often require a preliminary step o...
Abstract. Due to high data volumes and unpredictable arrival rates, continuous query systems process...
International audienceContinuous query processing in data stream management systems (DSMS) has recei...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
Continuous queries process real-time streaming data and output results in streams for a wide range o...
The join operation combines information from multiple data sources. Efficient processing of join que...
International audienceThis paper addresses the problem of computing approximate answers to continuou...
National audienceRecent years have witnessed the growth of a new class of data-intensive application...
Conventional data warehouses employ the query-at-a-time model, which maps each query to a distinct p...
International audienceThis paper addresses the problem of computing approximate answers to continuou...
Semi-stream join algorithms join a fast stream input with a disk-based master data relation. A commo...
Recent years have witnessed a rapid rise of a new class of data-intensive applications in which data...
Join optimization is one of the most challenging tasks in query processing. The perfor-mance of join...
One problem encountered in real-time data integration is the join of a continuous incoming data stre...
Efficient resource optimization is critical to manage the velocity and volume of real-time streaming...
Applications that involve data integration among multiple sources often require a preliminary step o...
Abstract. Due to high data volumes and unpredictable arrival rates, continuous query systems process...
International audienceContinuous query processing in data stream management systems (DSMS) has recei...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
Continuous queries process real-time streaming data and output results in streams for a wide range o...
The join operation combines information from multiple data sources. Efficient processing of join que...
International audienceThis paper addresses the problem of computing approximate answers to continuou...
National audienceRecent years have witnessed the growth of a new class of data-intensive application...
Conventional data warehouses employ the query-at-a-time model, which maps each query to a distinct p...
International audienceThis paper addresses the problem of computing approximate answers to continuou...
Semi-stream join algorithms join a fast stream input with a disk-based master data relation. A commo...
Recent years have witnessed a rapid rise of a new class of data-intensive applications in which data...
Join optimization is one of the most challenging tasks in query processing. The perfor-mance of join...
One problem encountered in real-time data integration is the join of a continuous incoming data stre...
Efficient resource optimization is critical to manage the velocity and volume of real-time streaming...
Applications that involve data integration among multiple sources often require a preliminary step o...