A common class of existing information retrieval system provides access to abstracts. For example Stanford University, through its FOLIO system, provides access to the INSPEC database of abstracts of the literature on physics, computer science, electrical engineering, etc. In this paper this database is studied by using a trace-driven simulation. We focus on physical index design, inverted index caching, and database scaling in a distributed shared-nothing system. All three issues are shown to have a strong effect on response time and throughput. Database scaling is explored in two ways. One way assumes an "optimal" configuration for a single host and then linearly scales the database by duplicating the host architecture as needed...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
Today independent publishers are offering digital libraries with fulltext archives. In an attempt to...
For peer-to-peer web search engines it is important to quickly process queries and return search res...
Many information retrieval systems provides access to abstracts. For example Stanford University, th...
As information explodes across the Internet and intranets, information retrieval (IR) systems must c...
The major emphasis of this paper is on analytical techniques for predicting the performance of vario...
Large document collections are increasingly available over the network. In order for users to access...
The proliferation of the world's \information highways " has renewed interest in e cie...
Information retrieval systems often have to deal with very large amounts of data. They must be able ...
Large-scale web and text retrieval systems deal with amounts of data that greatly exceed the capacit...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
The amount of information available over the Internet is increasing daily as well as the importance ...
Abstract—To address the rapid growth of the Internet, modern Web search engines have to adopt distri...
In a shared-nothing, distributed text retrieval system, queries are processed over an inverted index...
The potential to provide interactive data manipulation across high-speed nationwide networks is stim...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
Today independent publishers are offering digital libraries with fulltext archives. In an attempt to...
For peer-to-peer web search engines it is important to quickly process queries and return search res...
Many information retrieval systems provides access to abstracts. For example Stanford University, th...
As information explodes across the Internet and intranets, information retrieval (IR) systems must c...
The major emphasis of this paper is on analytical techniques for predicting the performance of vario...
Large document collections are increasingly available over the network. In order for users to access...
The proliferation of the world's \information highways " has renewed interest in e cie...
Information retrieval systems often have to deal with very large amounts of data. They must be able ...
Large-scale web and text retrieval systems deal with amounts of data that greatly exceed the capacit...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
The amount of information available over the Internet is increasing daily as well as the importance ...
Abstract—To address the rapid growth of the Internet, modern Web search engines have to adopt distri...
In a shared-nothing, distributed text retrieval system, queries are processed over an inverted index...
The potential to provide interactive data manipulation across high-speed nationwide networks is stim...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
Today independent publishers are offering digital libraries with fulltext archives. In an attempt to...
For peer-to-peer web search engines it is important to quickly process queries and return search res...