Applications that query into very large multidimensional datasets are becoming more common. Many self-describing scientific data file formats have also emerged, which have structural metadata to help navigate the multi-dimensional arrays that are stored in the files. The files may also contain application-specific semantic metadata. In this paper, we discuss efficient methods for performing searches for subsets of multi-dimensional data objects, using semantic information to build multidimensional indexes, and group data items into properly sized chunks to maximize disk I/O bandwidth. This work is the first step in the design and implementation of a generic indexing library that will work with various high-dimension scientific data file for...
Scientific applications that query into very large multidimensional datasets are becoming more commo...
Across domains massive amounts of scientific data are generated which are useful beyond their origin...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Applications that query into very large multi-dimensional datasets are becoming more common. Many ...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF...
While file system metadata is well characterized by a variety of workload studies, scientific metada...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Scientific datasets are often stored on distributed archival storage systems, because geographically...
International audienceWhile high-dimensional search-by-similarity techniques reached their maturity ...
We present a new dynamic index structure for multidimensional data. The considered index structure i...
Large archives and digital sky surveys with dimensions of bytes currently exist, while in the near...
Scientific applications that query into very large multidimensional datasets are becoming more commo...
Scientific applications that query into very large multidimensional datasets are becoming more commo...
Across domains massive amounts of scientific data are generated which are useful beyond their origin...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Applications that query into very large multi-dimensional datasets are becoming more common. Many ...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF...
While file system metadata is well characterized by a variety of workload studies, scientific metada...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Scientific datasets are often stored on distributed archival storage systems, because geographically...
International audienceWhile high-dimensional search-by-similarity techniques reached their maturity ...
We present a new dynamic index structure for multidimensional data. The considered index structure i...
Large archives and digital sky surveys with dimensions of bytes currently exist, while in the near...
Scientific applications that query into very large multidimensional datasets are becoming more commo...
Scientific applications that query into very large multidimensional datasets are becoming more commo...
Across domains massive amounts of scientific data are generated which are useful beyond their origin...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...