The proliferation of observational devices and sensors with networking capabilities has led to growth in both the rates and sources of data that ultimately contribute to extreme-scale data volumes. Datasets generated in such settings are often multidimensional, with each dimension accounting for a feature of interest. We posit that efficient evaluation of queries over such datasets must account for both the distri-bution of data values and the patterns in the queries them-selves. Configuring query evaluation by hand is infeasible given the data volumes, dimensionality, and the rates at which new data and queries arrive. In this paper, we de-scribe our algorithm to autonomously improve query evalu-ations over voluminous, distributed datasets...
This paper addresses optimizing the execution of range queries into multi-dimensional datasets on d...
Peer-to-peer networks are becoming a common form of online data exchange. Querying data, mostly fil...
To meet today's data management needs, it is a widespread practice to use distributed data storage a...
Abstract—Networked observational devices and remote sensing equipment continue to proliferate and co...
Abstract—The quantity and precision of geospatial and time series observational data being collected...
Abstract—Efficient access to voluminous multidimensional datasets is essential for several scientifi...
Abstract—Data volumes in the geosciences and related domains have grown significantly as sensing equ...
Spatial data storage stresses the capability of conventional DBMSs. We present a scalable distribute...
This work introduces decentralized query processing techniques based on MIDAS, a novel distributed m...
Scientific datasets are often stored on distributed archival storage systems, because geographically...
Exploring and analyzing large volumes of data plays an increasingly important role in many domains o...
Abstract—We describe the design of a high-throughput storage system, Galileo, for data streams gener...
Distributed query processing is of paramount importance in next-generation distribution services, su...
Exploring and analyzing large volumes of data plays an increasingly important role in many domains o...
BigData revolutionised the IT industry. It first interested the OLTP systems. Distributed Hash Table...
This paper addresses optimizing the execution of range queries into multi-dimensional datasets on d...
Peer-to-peer networks are becoming a common form of online data exchange. Querying data, mostly fil...
To meet today's data management needs, it is a widespread practice to use distributed data storage a...
Abstract—Networked observational devices and remote sensing equipment continue to proliferate and co...
Abstract—The quantity and precision of geospatial and time series observational data being collected...
Abstract—Efficient access to voluminous multidimensional datasets is essential for several scientifi...
Abstract—Data volumes in the geosciences and related domains have grown significantly as sensing equ...
Spatial data storage stresses the capability of conventional DBMSs. We present a scalable distribute...
This work introduces decentralized query processing techniques based on MIDAS, a novel distributed m...
Scientific datasets are often stored on distributed archival storage systems, because geographically...
Exploring and analyzing large volumes of data plays an increasingly important role in many domains o...
Abstract—We describe the design of a high-throughput storage system, Galileo, for data streams gener...
Distributed query processing is of paramount importance in next-generation distribution services, su...
Exploring and analyzing large volumes of data plays an increasingly important role in many domains o...
BigData revolutionised the IT industry. It first interested the OLTP systems. Distributed Hash Table...
This paper addresses optimizing the execution of range queries into multi-dimensional datasets on d...
Peer-to-peer networks are becoming a common form of online data exchange. Querying data, mostly fil...
To meet today's data management needs, it is a widespread practice to use distributed data storage a...