International audienceTwenty years after the pioneering experiments performed by Internet Archive and few national libraries, web archiving has become a common activity of many scientific, cultural, and heritage institutions. They are using a set of tools, generally open source, to identify, harvest, store, index, make available to end users, and preserve internet content over the long term. Institutions seeking to preserve web archives are however facing major challenges: not only the huge amount of collected data, but also the lack of fully reliable metadata, which are crucial to understand the web archives and inform future preservation actions upon them. Web archives are generally stored in container formats, notably the ARC file format...
International audienceThe Internet has been covered by legal deposit legislation in France since 200...
The British Library’s web archive comprises several terabyte of harvested websites. Like other conte...
Institutions that perform web crawls in order to gather heritage collections have millions – or even...
In a time where websites are ever changing, what metadata standards and tools are best for ensuring ...
International audienceInstitutions that perform web crawls in order to gather heritage collections h...
International audienceInstitutions that perform web crawls in order to gather heritage collections h...
Manager, Digital legal deposit Institutions that perform web crawls in order to gather heritage coll...
Selection and metadata issues which surround the preservation of digital information are discussed, ...
The National Library of France is mandated by French law to collect and preserve the French Internet...
The National Library of France is mandated by French law to collect and preserve the French Internet...
Metadata about digital objects help users find, understand, use and reuse those objects. Longevity o...
This study is to develope the structures and the elements of the metadata for harvesting, management...
The OCLC Research Library Partnership Web Archiving Metadata Working Group was formed to recommend d...
This presentation addresses the use, re-use, access and dissemination of data related to web archive...
There exist many digital collections of cultural and historical resources, referred to as digital ar...
International audienceThe Internet has been covered by legal deposit legislation in France since 200...
The British Library’s web archive comprises several terabyte of harvested websites. Like other conte...
Institutions that perform web crawls in order to gather heritage collections have millions – or even...
In a time where websites are ever changing, what metadata standards and tools are best for ensuring ...
International audienceInstitutions that perform web crawls in order to gather heritage collections h...
International audienceInstitutions that perform web crawls in order to gather heritage collections h...
Manager, Digital legal deposit Institutions that perform web crawls in order to gather heritage coll...
Selection and metadata issues which surround the preservation of digital information are discussed, ...
The National Library of France is mandated by French law to collect and preserve the French Internet...
The National Library of France is mandated by French law to collect and preserve the French Internet...
Metadata about digital objects help users find, understand, use and reuse those objects. Longevity o...
This study is to develope the structures and the elements of the metadata for harvesting, management...
The OCLC Research Library Partnership Web Archiving Metadata Working Group was formed to recommend d...
This presentation addresses the use, re-use, access and dissemination of data related to web archive...
There exist many digital collections of cultural and historical resources, referred to as digital ar...
International audienceThe Internet has been covered by legal deposit legislation in France since 200...
The British Library’s web archive comprises several terabyte of harvested websites. Like other conte...
Institutions that perform web crawls in order to gather heritage collections have millions – or even...