Web archives have contained the cultural history of the web for many years, but they still have a limited capability for access. Most of the web archiving research has focused on crawling and preservation activities, with little focus on the delivery methods. The current access methods are tightly coupled with web archive infrastructure, hard to replicate or integrate with other web archives, and do not cover all the users\u27 needs. In this dissertation, we focus on the access methods for archived web data to enable users, third-party developers, researchers, and others to gain knowledge from the web archives. We build ArcSys, a new service framework that extracts, preserves, and exposes APIs for the web archive corpus. The dissertation in...
Many organizations and institutions rely heavily on a web presence to disseminate information and to...
Web archiving is the process of collecting valuable content from the World Wide Web in a an archival...
Current version: v1.2.0 We tend to think of a web archive as a site we go to when links are broken –...
Web archives have contained the cultural history of the web for many years, but they still have a li...
Web archives have contained the cultural history of the web for many years, but they still have a li...
Web archives have contained the cultural history of the web for many years, but they still have a li...
The Archives Unleashed project aims to improve scholarly access to web archives through a multi-pron...
Currently, web archives are challenging for users to discover and use. Many archives and libraries a...
Unlocking web archives through metadata, seed lists and derived data Frédéric Clavert and Valérie ...
This presentation addresses the use, re-use, access and dissemination of data related to web archive...
Web archives preserve the live Web for posterity, but the content on the Web one cares about may not...
Web archives preserve the live Web for posterity, but the content on the Web one cares about may not...
Collections are the tools that people use to make sense of an ever-increasing number of archived web...
Collections are the tools that people use to make sense of an ever-increasing number of archived web...
With the proliferation of public web archives, it is becoming more important to better profile their...
Many organizations and institutions rely heavily on a web presence to disseminate information and to...
Web archiving is the process of collecting valuable content from the World Wide Web in a an archival...
Current version: v1.2.0 We tend to think of a web archive as a site we go to when links are broken –...
Web archives have contained the cultural history of the web for many years, but they still have a li...
Web archives have contained the cultural history of the web for many years, but they still have a li...
Web archives have contained the cultural history of the web for many years, but they still have a li...
The Archives Unleashed project aims to improve scholarly access to web archives through a multi-pron...
Currently, web archives are challenging for users to discover and use. Many archives and libraries a...
Unlocking web archives through metadata, seed lists and derived data Frédéric Clavert and Valérie ...
This presentation addresses the use, re-use, access and dissemination of data related to web archive...
Web archives preserve the live Web for posterity, but the content on the Web one cares about may not...
Web archives preserve the live Web for posterity, but the content on the Web one cares about may not...
Collections are the tools that people use to make sense of an ever-increasing number of archived web...
Collections are the tools that people use to make sense of an ever-increasing number of archived web...
With the proliferation of public web archives, it is becoming more important to better profile their...
Many organizations and institutions rely heavily on a web presence to disseminate information and to...
Web archiving is the process of collecting valuable content from the World Wide Web in a an archival...
Current version: v1.2.0 We tend to think of a web archive as a site we go to when links are broken –...