As scientific instruments and computer simulations produce more and more data, the task of locating the essential information to gain insight becomes increasingly difficult. FastBit is an efficient software tool to address this challenge. In this article, we present a summary of the key underlying technologies, namely bitmap compression, encoding, and binning. Together these techniques enable FastBit to answer structured (SQL) queries orders of magnitude faster than popular database systems. To illustrate how FastBit is used in applications, we present three examples involving a high-energy physics experiment, a combustion simulation, and an accelerator simulation. In each case, FastBit significantly reduces the response time and enables in...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
As scientific instruments and computer simulations produce more and more data, the task of locating ...
FastBit is a software tool for searching large read-only data sets. It organizes user data in a col...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...
FastBit is a software package designed to meet the searching and filtering needs of data intensive ...
Abstract. FastBit is a software tool for searching large read-only datasets. It organizes user data ...
The database and the information retrieval communities have been working on separate sets of techniq...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
In this paper, we describe a strategy of using compressed bitmap indices to speed up queries on bot...
An index in a database system is a data structure that utilizes redundant information about the base...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Most physics analysis jobs involve multiple selection steps on the input data. These selection steps...
FastBit is an efficient, compressed bitmap indexing technology that was developed in our group. In ...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
As scientific instruments and computer simulations produce more and more data, the task of locating ...
FastBit is a software tool for searching large read-only data sets. It organizes user data in a col...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...
FastBit is a software package designed to meet the searching and filtering needs of data intensive ...
Abstract. FastBit is a software tool for searching large read-only datasets. It organizes user data ...
The database and the information retrieval communities have been working on separate sets of techniq...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
In this paper, we describe a strategy of using compressed bitmap indices to speed up queries on bot...
An index in a database system is a data structure that utilizes redundant information about the base...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Most physics analysis jobs involve multiple selection steps on the input data. These selection steps...
FastBit is an efficient, compressed bitmap indexing technology that was developed in our group. In ...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....