This paper is concerned with the efficient execution of multiple query workloads on a cluster of SMPs. We target applications that access and manipulate large scientific datasets. Queries in these applications involve user-defined processing operations on data and distributed data structures to hold intermediate and final results. Our goal is to implement system components to leverage previously computed query results and to effectively utilize processing power and aggregated I/O bandwidth on SMP nodes so that both single queries and multi-query batches can be efficiently executed. (Also referenced as UMIACS-TR-2001-78
This work addresses the problem of sharing execution plans for queries that continuously cluster str...
Database systems allow for concurrent use of several applications (and query interfaces). Each appli...
The rapid increase in the data volumes encountered in many application domains has led to widespread...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SMP...
Applications that analyze, mine, and visualize large datasets is considered an important class of a...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
This work investigates the leverage that can be obtained from compiler optimization techniques for ...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
MQO is a distributed multiple query processing middleware that can optimize query processing for da...
Data analysis applications such as Kronos, a remote sensing application, and the Virtual Microscope,...
Abstract—This paper proposes a strategy to organize metric-space query processing in multi-core sear...
Query scheduling plays an important role when systems are faced with limited resources and high wor...
This paper proposes a complementary novel idea, called MiniTasking to further reduce the number of c...
The upcoming generation of computer hardware poses several new challenges for database developers an...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
This work addresses the problem of sharing execution plans for queries that continuously cluster str...
Database systems allow for concurrent use of several applications (and query interfaces). Each appli...
The rapid increase in the data volumes encountered in many application domains has led to widespread...
This paper is concerned with the efficient execution of multiple query workloads on a cluster of SMP...
Applications that analyze, mine, and visualize large datasets is considered an important class of a...
In modern large-scale distributed systems, analytics jobs submitted by various users often share sim...
This work investigates the leverage that can be obtained from compiler optimization techniques for ...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
MQO is a distributed multiple query processing middleware that can optimize query processing for da...
Data analysis applications such as Kronos, a remote sensing application, and the Virtual Microscope,...
Abstract—This paper proposes a strategy to organize metric-space query processing in multi-core sear...
Query scheduling plays an important role when systems are faced with limited resources and high wor...
This paper proposes a complementary novel idea, called MiniTasking to further reduce the number of c...
The upcoming generation of computer hardware poses several new challenges for database developers an...
Queries with common sequences of disk accesses can make maximal use of a buffer pool. We developed a...
This work addresses the problem of sharing execution plans for queries that continuously cluster str...
Database systems allow for concurrent use of several applications (and query interfaces). Each appli...
The rapid increase in the data volumes encountered in many application domains has led to widespread...