Applications that query into very large multidimensional datasets are becoming more common. Many self-describing scientific data file formats have also emerged, which have structural metadata to help navigate the multi-dimensional arrays that are stored in the files. The files may also contain application-specific semantic metadata. In this paper, we discuss efficient methods for performing searches for subsets of multi-dimensional data objects, using semantic information to build multidimensional indexes, and group data items into properly sized chunks to maximize disk I/O bandwidth. This work is the first step in the design and implementation of a generic indexing library that will work with various high-dimension scientific data file for...
We present a new dynamic index structure for multidimensional data. The considered index structure i...
Across domains massive amounts of scientific data are generated which are useful beyond their origin...
The ability to extract information from collected data has always driven science. Today.s large comp...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
While file system metadata is well characterized by a variety of workload studies, scientific metada...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
International audienceWhile high-dimensional search-by-similarity techniques reached their maturity ...
Scientific datasets are often stored on distributed archival storage systems, because geographically...
Large archives and digital sky surveys with dimensions of bytes currently exist, while in the near...
Scientific applications that query into very large multidimensional datasets are becoming more commo...
Scientific applications that query into very large multidimensional datasets are becoming more commo...
Scientific data analysis applications require large scale computing power to effectively service cli...
We present a new dynamic index structure for multidimensional data. The considered index structure i...
Across domains massive amounts of scientific data are generated which are useful beyond their origin...
The ability to extract information from collected data has always driven science. Today.s large comp...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
Applications that query into very large multidimensional datasets are becoming more common. Many sel...
While file system metadata is well characterized by a variety of workload studies, scientific metada...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF...
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
Large scale scientific data is often stored in scientific data formats such as FITS, netCDF and HDF....
International audienceWhile high-dimensional search-by-similarity techniques reached their maturity ...
Scientific datasets are often stored on distributed archival storage systems, because geographically...
Large archives and digital sky surveys with dimensions of bytes currently exist, while in the near...
Scientific applications that query into very large multidimensional datasets are becoming more commo...
Scientific applications that query into very large multidimensional datasets are becoming more commo...
Scientific data analysis applications require large scale computing power to effectively service cli...
We present a new dynamic index structure for multidimensional data. The considered index structure i...
Across domains massive amounts of scientific data are generated which are useful beyond their origin...
The ability to extract information from collected data has always driven science. Today.s large comp...