This paper is concerned with the efficient execution of multiple query workloads on a cluster of SMPs. We tar-get applications that access and manipulate large scientific datasets. Queries in these applications involve user-defined processing operations and distributed data structures to hold intermediate and final results. Our goal is to implement system components to leverage previously computed query results and to effectively utilize processing power and ag-gregated I/O bandwidth on SMP nodes so that both single queries and multi-query batches can be efficiently executed.
Data analysis applications such as Kronos, a remote sensing application, and the Virtual Microscope,...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
The widespread dissemination of small-scale sensor nodes has sparked interest in a powerful new dat...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SMP...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SM...
Applications that analyze, mine, and visualize large datasets is considered an important class of a...
This work investigates the leverage that can be obtained from compiler optimization techniques for e...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
Data analysis applications in areas as diverse as remote sensing and telepathology require operating...
[[abstract]]Mermaid is a testbed system which provides integrated access to multiple databases. Two ...
International audienceMapReduce model is a new parallel programming model initially developed for la...
The diversity and large volumes of data processed in the Natural Sciences today has led to a prolife...
Abstract—This paper proposes a strategy to organize metric-space query processing in multi-core sear...
This work addresses the problem of sharing execution plans for queries that continuously cluster str...
Some recently proposed extensions to relational database systems as well as deductive database syste...
Data analysis applications such as Kronos, a remote sensing application, and the Virtual Microscope,...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
The widespread dissemination of small-scale sensor nodes has sparked interest in a powerful new dat...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SMP...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SM...
Applications that analyze, mine, and visualize large datasets is considered an important class of a...
This work investigates the leverage that can be obtained from compiler optimization techniques for e...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
Data analysis applications in areas as diverse as remote sensing and telepathology require operating...
[[abstract]]Mermaid is a testbed system which provides integrated access to multiple databases. Two ...
International audienceMapReduce model is a new parallel programming model initially developed for la...
The diversity and large volumes of data processed in the Natural Sciences today has led to a prolife...
Abstract—This paper proposes a strategy to organize metric-space query processing in multi-core sear...
This work addresses the problem of sharing execution plans for queries that continuously cluster str...
Some recently proposed extensions to relational database systems as well as deductive database syste...
Data analysis applications such as Kronos, a remote sensing application, and the Virtual Microscope,...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
The widespread dissemination of small-scale sensor nodes has sparked interest in a powerful new dat...