Query processing and optimization in mediator systems that access distributed non-proprietary sources pose many novel problems. Cost-based query optimization is hard because the mediator does not have access to source statistics information and furthermore it may not be easy to model the source's performance. At the same time, querying remote sources may be very expensive because of high connection overhead, long computation time, financial charges, and temporary unavailability. We propose a costbased optimization technique that caches statistics of actual calls to the sources and consequently estimates the cost of the possible execution plans based on the statistics cache. We investigate issues pertaining to the design of the statisti...
MQO is a distributed multiple query processing middleware that can use resources available on the Gr...
Search engines and large scale IR systems need to cache query results for efficiency and scalability...
Caching is fundamental to performance in distributed information retrieval systems such as the World...
Search engines and large-scale IR systems need to cache query results for efficiency and scalability...
In data integration systems, queries posed to a mediator need to be translated into a sequence of qu...
In this paper, we study the performance of a distributed search engine from a data caching point of ...
International audienceLeSelect is a mediator system which allows scientists to publish their resourc...
LeSelect is a mediator system which allows scientists to publish their resources (data and programs)...
We study how to reduce costs in client-server applications with dynamic data on the server. Client-s...
Search engines and large scale IR systems need to cache query results for efficiency and scalability...
There is currently great interest in building information mediators that can integrate information f...
Data intensive applications today usually run in either a clientserver or a middleware environment. ...
Scientific database federations are geographically dis-tributed and network bound. Thus, they could ...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
MQO is a distributed multiple query processing middleware that can use resources available on the Gr...
MQO is a distributed multiple query processing middleware that can use resources available on the Gr...
Search engines and large scale IR systems need to cache query results for efficiency and scalability...
Caching is fundamental to performance in distributed information retrieval systems such as the World...
Search engines and large-scale IR systems need to cache query results for efficiency and scalability...
In data integration systems, queries posed to a mediator need to be translated into a sequence of qu...
In this paper, we study the performance of a distributed search engine from a data caching point of ...
International audienceLeSelect is a mediator system which allows scientists to publish their resourc...
LeSelect is a mediator system which allows scientists to publish their resources (data and programs)...
We study how to reduce costs in client-server applications with dynamic data on the server. Client-s...
Search engines and large scale IR systems need to cache query results for efficiency and scalability...
There is currently great interest in building information mediators that can integrate information f...
Data intensive applications today usually run in either a clientserver or a middleware environment. ...
Scientific database federations are geographically dis-tributed and network bound. Thus, they could ...
The multiple-query optimization (MQO) problem has been well-studied in the research literature, usu...
MQO is a distributed multiple query processing middleware that can use resources available on the Gr...
MQO is a distributed multiple query processing middleware that can use resources available on the Gr...
Search engines and large scale IR systems need to cache query results for efficiency and scalability...
Caching is fundamental to performance in distributed information retrieval systems such as the World...