Objectivity federated databases may contain many terabytes of data and span thousands of files. In such an environment, it is often easy for a user to pose a query that may return an iterator over millions of objects, requiring opening thousands of databases. This presentation describes several technologies developed for such settings: (1) a query estimator, which tells the user how many objects satisfy the query, and how many databases will be touched, prior to opening all of those files; (2) an order-optimized iterator, which behaves like an ordinary iterator except that elements are returned in an order optimized for efficient access, presorted by the database (and container) in which they reside; (3) a parallel implementation of the ord...
Finding a good join order is crucial for query performance. In this paper, we introduce the Join Ord...
As more and more data is provided in RDF format, storing huge amounts of RDF data and efficiently pr...
Physical database design is important for query performance in a shared-nothing parallel database sy...
Large volumes of data produced and shared within scientific communities are analyzed by many researc...
Over the past decade, a number of data intensive scalable systems have been developed to process ext...
The dream of computing power as readily available as the electricity in a wall socket is coming clos...
The diversity and large volumes of data processed in the Natural Sciences today has led to a prolife...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
As more and more data is provided in RDF format, storing huge amounts of RDF data and efficiently pr...
Data analysis applications such as Kronos, a remote sensing application, and the Virtual Microscope,...
Processing and storage of a large amount of information is one of the difficult and interesting task...
International audienceIn the era of bigdata, with a massive set of digital information of unpreceden...
A major area of concern with very large databases is that of query and access time. This area has pl...
Database management systems will continue to manage large data volumes. Thus, efficient algorithms f...
In the very large object database systems planned for some future particle physics experiments, typi...
Finding a good join order is crucial for query performance. In this paper, we introduce the Join Ord...
As more and more data is provided in RDF format, storing huge amounts of RDF data and efficiently pr...
Physical database design is important for query performance in a shared-nothing parallel database sy...
Large volumes of data produced and shared within scientific communities are analyzed by many researc...
Over the past decade, a number of data intensive scalable systems have been developed to process ext...
The dream of computing power as readily available as the electricity in a wall socket is coming clos...
The diversity and large volumes of data processed in the Natural Sciences today has led to a prolife...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
As more and more data is provided in RDF format, storing huge amounts of RDF data and efficiently pr...
Data analysis applications such as Kronos, a remote sensing application, and the Virtual Microscope,...
Processing and storage of a large amount of information is one of the difficult and interesting task...
International audienceIn the era of bigdata, with a massive set of digital information of unpreceden...
A major area of concern with very large databases is that of query and access time. This area has pl...
Database management systems will continue to manage large data volumes. Thus, efficient algorithms f...
In the very large object database systems planned for some future particle physics experiments, typi...
Finding a good join order is crucial for query performance. In this paper, we introduce the Join Ord...
As more and more data is provided in RDF format, storing huge amounts of RDF data and efficiently pr...
Physical database design is important for query performance in a shared-nothing parallel database sy...