Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF. These storage formats are of particular interest to the scientific user community since they provide multi-dimensional storage and retrieval. However, one of the drawbacks of these storage formats is that they do not support semantic indexing which is important for interactive data analysis where scientists look for features of interests such as ''Find all supernova explosions where energy >105 and temperature >106''. In this paper we present a novel approach called HDF5-FastQuery to accelerate the data access of large HDF5 files by introducing multi-dimensional semantic indexing. Our implementation leverages an efficient indexing tech...
Abstract. FastBit is a software tool for searching large read-only datasets. It organizes user data ...
In this chapter, we explore ways to answer queries on large multi-dimensional data efficiently. Giv...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF...
This work focuses on research and development activities that bridge a gap between fundamental data...
This work focuses on research and development activities that bridge a gap between fundamental data ...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
The ability to extract information from collected data has always driven science. Today.s large comp...
FastQuery is a parallel indexing and querying system we developed for accelerating analysis and visu...
In this paper, we describe a strategy of using compressed bitmap indices to speed up queries on bot...
Abstract. FastBit is a software tool for searching large read-only datasets. It organizes user data ...
In this chapter, we explore ways to answer queries on large multi-dimensional data efficiently. Giv...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF...
This work focuses on research and development activities that bridge a gap between fundamental data...
This work focuses on research and development activities that bridge a gap between fundamental data ...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
The ability to extract information from collected data has always driven science. Today.s large comp...
FastQuery is a parallel indexing and querying system we developed for accelerating analysis and visu...
In this paper, we describe a strategy of using compressed bitmap indices to speed up queries on bot...
Abstract. FastBit is a software tool for searching large read-only datasets. It organizes user data ...
In this chapter, we explore ways to answer queries on large multi-dimensional data efficiently. Giv...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...