Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF. These storage formats are of particular interest to the scientific user community since they provide multi-dimensional storage and retrieval. However, one of the drawbacks of these storage formats is that they do not support semantic indexing which is important for interactive data analysis where scientists look for features of interests such as "Find all supernova explosions where energy > 105 and temperature > 106". In this paper we present a novel approach called HDF5-FastQuery to accelerate the data access of large HDF5 files by introducing multi-dimensional semantic indexing. Our implementation leverages an efficient indexi...
FastQuery is a parallel indexing and querying system we developed for accelerating analysis and visu...
In this paper, we describe a strategy of using compressed bitmap indices to speed up queries on bot...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
This work focuses on research and development activities that bridge a gap between fundamental data...
This work focuses on research and development activities that bridge a gap between fundamental data ...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
The ability to extract information from collected data has always driven science. Today.s large comp...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
FastQuery is a parallel indexing and querying system we developed for accelerating analysis and visu...
In this paper, we describe a strategy of using compressed bitmap indices to speed up queries on bot...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
This work focuses on research and development activities that bridge a gap between fundamental data...
This work focuses on research and development activities that bridge a gap between fundamental data ...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
The ability to extract information from collected data has always driven science. Today.s large comp...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
FastQuery is a parallel indexing and querying system we developed for accelerating analysis and visu...
In this paper, we describe a strategy of using compressed bitmap indices to speed up queries on bot...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...