Modern scientific datasets present numerous data management and analysis challenges. State-of-the- art index and query technologies such as FastBit can significantly improve accesses to these datasets by augmenting the user data with indexes and other secondary information. However, a challenge is that the indexes assume the relational data model but the scientific data generally follows the array data model. To match the two data models, we design a generic mapping mechanism and implement an efficient input and output interface for reading and writing the data and their corresponding indexes. To take advantage of the emerging many-core architectures, we also develop a parallel strategy for indexing using threading technology. This approach...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...
FastBit is a software tool for searching large read-only data sets. It organizes user data in a col...
The ability to extract information from collected data has always driven science. Today.s large comp...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
FastQuery is a parallel indexing and querying system we developed for accelerating analysis and visu...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF...
This work focuses on research and development activities that bridge a gap between fundamental data...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
This work focuses on research and development activities that bridge a gap between fundamental data ...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
As computing power increases exponentially, vast amount of data is created by many scientific re- se...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...
FastBit is a software tool for searching large read-only data sets. It organizes user data in a col...
The ability to extract information from collected data has always driven science. Today.s large comp...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
FastQuery is a parallel indexing and querying system we developed for accelerating analysis and visu...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF...
This work focuses on research and development activities that bridge a gap between fundamental data...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
This work focuses on research and development activities that bridge a gap between fundamental data ...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
As computing power increases exponentially, vast amount of data is created by many scientific re- se...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...
FastBit is a software tool for searching large read-only data sets. It organizes user data in a col...
The ability to extract information from collected data has always driven science. Today.s large comp...