Abstract. With the proliferation of public web archives, it is becom-ing more important to better profile their contents, both to understand their immense holdings as well as support routing of requests in the Me-mento aggregator. To save time, the Memento aggregator should only poll the archives that are likely to have a copy of the requested URI. Using the CDX files produced after crawling, we can generate profiles of the archives that summarize their holdings and can be used to in-form routing of the Memento aggregator’s URI requests. Previous work in profiling ranged from using full URIs (no false positives, but with large profiles) to using only top-level domains (TLDs) (smaller profiles, but with many false positives). This work explo...
Memento aggregators enable users to query multiple web archives for captures of a URI in time throug...
PDF of a powerpoint presentation from TPDL 2013: 17th International Conference on Theory and Practic...
Web archives are a window to view past versions of webpages. When a user requests a webpage on the l...
With the proliferation of public web archives, it is becoming more important to better profile their...
Abstract. The Memento aggregator currently polls every known pub-lic web archive when serving a requ...
We introduce MementoMap, a framework to express and disseminate holdings of web archives (archive pr...
PDF of a powerpoint presentation from the 2014 International Internet Preservation Consortium (IIPC)...
Web archives have contained the cultural history of the web for many years, but they still have a li...
Web archives preserve the history of Web sites and have high long-term value for media and busines...
Web archives have contained the cultural history of the web for many years, but they still have a li...
The Memento Project’s archive access additions to HTTP have enabled development of new web archive a...
International audienceDue to the growing importance of the Web, several archiving institutes (nation...
Web archives, a key area of digital preservation, meet the needs of journalists, social scientists, ...
Currently, web archives are challenging for users to discover and use. Many archives and libraries a...
ftp archive sites are well known throughout the Internet community for their wealth of useful inform...
Memento aggregators enable users to query multiple web archives for captures of a URI in time throug...
PDF of a powerpoint presentation from TPDL 2013: 17th International Conference on Theory and Practic...
Web archives are a window to view past versions of webpages. When a user requests a webpage on the l...
With the proliferation of public web archives, it is becoming more important to better profile their...
Abstract. The Memento aggregator currently polls every known pub-lic web archive when serving a requ...
We introduce MementoMap, a framework to express and disseminate holdings of web archives (archive pr...
PDF of a powerpoint presentation from the 2014 International Internet Preservation Consortium (IIPC)...
Web archives have contained the cultural history of the web for many years, but they still have a li...
Web archives preserve the history of Web sites and have high long-term value for media and busines...
Web archives have contained the cultural history of the web for many years, but they still have a li...
The Memento Project’s archive access additions to HTTP have enabled development of new web archive a...
International audienceDue to the growing importance of the Web, several archiving institutes (nation...
Web archives, a key area of digital preservation, meet the needs of journalists, social scientists, ...
Currently, web archives are challenging for users to discover and use. Many archives and libraries a...
ftp archive sites are well known throughout the Internet community for their wealth of useful inform...
Memento aggregators enable users to query multiple web archives for captures of a URI in time throug...
PDF of a powerpoint presentation from TPDL 2013: 17th International Conference on Theory and Practic...
Web archives are a window to view past versions of webpages. When a user requests a webpage on the l...