A Web archive usually contains multiple versions of documents crawled from the Web at different points in time. One possible way for users to access a Web archive is through full-text search systems. However, previous studies have shown that these systems can induce a bias, known as the retrievability bias, on the accessibility of documents in community-collected collections (such as TREC collections). This bias can be measured by analyzing the distribution of the retrievability scores for each document in a collection, quantifying the likelihood of a document’s retrieval. We investigate the suitability of retrievability scores in retrieval systems that consider every version of a document in a Web archive as an independent document. We sho...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Retrievability is an important and interesting indicator that can be used in a number of ways to ana...
Retrievability provides an alternative way to assess an Information Retrieval (IR) system by measuri...
htmlabstractA Web archive usually contains multiple versions of documents crawled from the Web at di...
A Web archive usually contains multiple versions of documents crawled from the Web at different poin...
A Web archive usually contains multiple versions of documents crawled from the Web at different poin...
A Web archive usually contains multiple versions of documents crawled from the Web at different poin...
A Web archive usually contains multiple versions of documents crawled from the Web at different poin...
Retrievability is the measure of how easily a document can be retrieved using a particular retrieval...
Retrievability is the measure of how easily a document can be retrieved using a particular retrieval...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Most information retrieval settings, such as web search, are typically precision-oriented, i.e. they...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Retrievability is an important and interesting indicator that can be used in a number of ways to ana...
Retrievability provides an alternative way to assess an Information Retrieval (IR) system by measuri...
htmlabstractA Web archive usually contains multiple versions of documents crawled from the Web at di...
A Web archive usually contains multiple versions of documents crawled from the Web at different poin...
A Web archive usually contains multiple versions of documents crawled from the Web at different poin...
A Web archive usually contains multiple versions of documents crawled from the Web at different poin...
A Web archive usually contains multiple versions of documents crawled from the Web at different poin...
Retrievability is the measure of how easily a document can be retrieved using a particular retrieval...
Retrievability is the measure of how easily a document can be retrieved using a particular retrieval...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Most information retrieval settings, such as web search, are typically precision-oriented, i.e. they...
Bias in the retrieval of documents can directly influence the information access of a digital librar...
Retrievability is an important and interesting indicator that can be used in a number of ways to ana...
Retrievability provides an alternative way to assess an Information Retrieval (IR) system by measuri...