Conventional approaches to information retrieval search through all applicable entries in an inverted file for a particular collection in order to find those documents with the highest scores. For particularly large collections this may be extremely time consuming. A solution to this problem is to only search a limited amount of the collection at query-time, in order to speed up the retrieval process. In doing this we can also limit the loss in retrieval efficacy (in terms of accuracy of results). The way we achieve this is to firstly identify the most “important” documents within the collection, and sort documents within inverted file lists in order of this “importance”. In this way we limit the amount of information to be searched at que...
When using Information Retrieval Systems (IRS), users often present search queries made of ad-hoc ke...
The Web search engines maintain large-scale inverted indexes which are queried thousands of times pe...
Text search engines return a set of k documents ranked by similarity to a query. Typically, document...
Conventional approaches to information retrieval search through all applicable entries in an inverte...
In this poster we describe alternative inverted index structures that reduce the time required to pr...
Similarity calculations and document ranking form the computationally expensive parts of query proce...
Ranking techniques have long been suggested as alternatives to more conventional Boolean methods for...
Our research shows that for large databases, without considerable additional storage overhead, clust...
An algorithm is described for ordering by probability of relevance overlapping document subsets from...
Information retrieval over clustered document collections has two successive stages: first identifyi...
Information retrieval is the process of recalling and ordering all relevant documents based on a use...
For the 2006 Terabyte track in TREC, Dublin City University’s participation was focussed on the ad h...
The task of an information retrieval system is to identify documents that will satisfy a user’s info...
Magíster en Ciencias, Mención ComputaciónWeb search has become an important part of day-to-day life....
This report aims to asses the efficiency of various inverted indexes when the indexed document colle...
When using Information Retrieval Systems (IRS), users often present search queries made of ad-hoc ke...
The Web search engines maintain large-scale inverted indexes which are queried thousands of times pe...
Text search engines return a set of k documents ranked by similarity to a query. Typically, document...
Conventional approaches to information retrieval search through all applicable entries in an inverte...
In this poster we describe alternative inverted index structures that reduce the time required to pr...
Similarity calculations and document ranking form the computationally expensive parts of query proce...
Ranking techniques have long been suggested as alternatives to more conventional Boolean methods for...
Our research shows that for large databases, without considerable additional storage overhead, clust...
An algorithm is described for ordering by probability of relevance overlapping document subsets from...
Information retrieval over clustered document collections has two successive stages: first identifyi...
Information retrieval is the process of recalling and ordering all relevant documents based on a use...
For the 2006 Terabyte track in TREC, Dublin City University’s participation was focussed on the ad h...
The task of an information retrieval system is to identify documents that will satisfy a user’s info...
Magíster en Ciencias, Mención ComputaciónWeb search has become an important part of day-to-day life....
This report aims to asses the efficiency of various inverted indexes when the indexed document colle...
When using Information Retrieval Systems (IRS), users often present search queries made of ad-hoc ke...
The Web search engines maintain large-scale inverted indexes which are queried thousands of times pe...
Text search engines return a set of k documents ranked by similarity to a query. Typically, document...