Web archives, such as the Internet Archive, preserve the web and allow access to prior states of web pages. We implicitly trust their versions of archived pages, but as their role moves from preserving curios of the past to facilitating present day adjudication, we are concerned with verifying the fixity of archived web pages, or mementos, to ensure they have always remained unaltered. A widely used technique in digital preservation to verify the fixity of an archived resource is to periodically compute a cryptographic hash value on a resource and then compare it with a previous hash value. If the hash values generated on the same resource are identical, then the fixity of the resource is verified. We tested this process by conducting a stu...
Web archiving is serving the task of knowledge preservation for the ever changing state of the web. ...
While conducting a validation study of proficiency test media we found that applying the same hash a...
The current Web has no general mechanisms to make digital artifacts-such as datasets, code, texts, a...
This work investigates the fixity of a set of archived webpages, or mementos. We conducted a study o...
The number of public and private web archives has increased, and we implicitly trust content deliver...
To make digital resources on the web verifiable, immutable, and permanent, we propose a technique to...
The Internet Archive’s Wayback Machine is the largest modern web archive, preserving web content sin...
One of the most important aspects of the long-term digital-image preservation strategy is maintainin...
Web archives do not capture every resource on every page that they attempt to archive. This results ...
Web archives preserve the live Web for posterity, but the content on the Web one cares about may not...
[First paragraph] While analyzing mementos in a recent experiment, we discovered problems processing...
When replaying an archived web page (known as a memento), the fundamental expectation is that the pa...
An Internet web browser can archive webpages visited by a user. The archived pages can be verified b...
Quantifying the captures of a URI over time is useful for researchers to identify the extent to whic...
Certain HTTP Cookies on certain sites can be a source of content bias in archival crawls. Accommodat...
Web archiving is serving the task of knowledge preservation for the ever changing state of the web. ...
While conducting a validation study of proficiency test media we found that applying the same hash a...
The current Web has no general mechanisms to make digital artifacts-such as datasets, code, texts, a...
This work investigates the fixity of a set of archived webpages, or mementos. We conducted a study o...
The number of public and private web archives has increased, and we implicitly trust content deliver...
To make digital resources on the web verifiable, immutable, and permanent, we propose a technique to...
The Internet Archive’s Wayback Machine is the largest modern web archive, preserving web content sin...
One of the most important aspects of the long-term digital-image preservation strategy is maintainin...
Web archives do not capture every resource on every page that they attempt to archive. This results ...
Web archives preserve the live Web for posterity, but the content on the Web one cares about may not...
[First paragraph] While analyzing mementos in a recent experiment, we discovered problems processing...
When replaying an archived web page (known as a memento), the fundamental expectation is that the pa...
An Internet web browser can archive webpages visited by a user. The archived pages can be verified b...
Quantifying the captures of a URI over time is useful for researchers to identify the extent to whic...
Certain HTTP Cookies on certain sites can be a source of content bias in archival crawls. Accommodat...
Web archiving is serving the task of knowledge preservation for the ever changing state of the web. ...
While conducting a validation study of proficiency test media we found that applying the same hash a...
The current Web has no general mechanisms to make digital artifacts-such as datasets, code, texts, a...