New application domains cause todays database sizes to grow rapidly, posing great demands on technology. Data fragmentation facilitates techniques (like distribution, parallelization, and main-memory computing) meeting these demands. Also, fragmentation might help improving effcient processing of query types such as top N. Database design and query optimization require a good notion of the costs resulting from a certain fragmentation. Our mathematically derived selectivity model facilitates this. Once its two parameters have been computed based on the fragmentation, after each (though usually infrequent) update, our model can forget the data distribution, resulting in fast and quite good selectivity estimation. We show experimental verifica...
Abstract The proliferation of online information resources increases the importance of effective and...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
Abstract. The distributed data processing is an effective way to improve reliability, avail-ability ...
New application domains cause today's database sizes to grow rapidly, posing great demands on techno...
In the estimation of selectivity, many models assume that data is uniformly distributed, which is no...
Data abstraction and query processing techniques are usually studied in the domain of administrative...
Data abstraction and query processing techniques are usually studied in the domain of administrative...
1 Introduction Information retrieval (IR) deals with the problem of retrieving documents relevant to...
The balance between privacy and utility is a classical problem with an increasing impact on the desi...
Distributed processing is an operative way to improve the performance of the distributed database sy...
Server selection is typically defined as maximizing network performance under the assumption that ea...
Information retrieval is becoming increasingly concerned with resource selection and data fusion for...
Distributed database technology is expected to have a significant impact on data processing in the u...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
In this paper, two-phase horizontal partitioning of distributed databases is addressed. First, prima...
Abstract The proliferation of online information resources increases the importance of effective and...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
Abstract. The distributed data processing is an effective way to improve reliability, avail-ability ...
New application domains cause today's database sizes to grow rapidly, posing great demands on techno...
In the estimation of selectivity, many models assume that data is uniformly distributed, which is no...
Data abstraction and query processing techniques are usually studied in the domain of administrative...
Data abstraction and query processing techniques are usually studied in the domain of administrative...
1 Introduction Information retrieval (IR) deals with the problem of retrieving documents relevant to...
The balance between privacy and utility is a classical problem with an increasing impact on the desi...
Distributed processing is an operative way to improve the performance of the distributed database sy...
Server selection is typically defined as maximizing network performance under the assumption that ea...
Information retrieval is becoming increasingly concerned with resource selection and data fusion for...
Distributed database technology is expected to have a significant impact on data processing in the u...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
In this paper, two-phase horizontal partitioning of distributed databases is addressed. First, prima...
Abstract The proliferation of online information resources increases the importance of effective and...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
Abstract. The distributed data processing is an effective way to improve reliability, avail-ability ...