International audienceInstitutions that perform web crawls in order to gather heritage collections have millions - or even billions - of files encoded in thousands of different formats about which they barely know anything. Many of these heritage institutions are members of the International Internet Preservation Consortium, whose Preservation Working Group decided to address the issues related to format identification in web archive. Its first goal is to design an overview of the formats to be found in different types of collections (large-, small-scale...) over time. It shows that the web seems to be becoming a more standardized space. A small number of formats - frequently open - cover from 90 to 95% of web archive collections, and we ca...
Web archives created by the Internet Archive (IA) (https://archive.org), national libraries and othe...
Abstract. Digital libraries have been built all over the world. Libraries are engaged in creating an...
Digital preservation can encompass a range of activities, from simple replication and storage to mor...
International audienceInstitutions that perform web crawls in order to gather heritage collections h...
Institutions that perform web crawls in order to gather heritage collections have millions – or even...
Manager, Digital legal deposit Institutions that perform web crawls in order to gather heritage coll...
International audienceTwenty years after the pioneering experiments performed by Internet Archive an...
Web archives, a key area of digital preservation, meet the needs of journalists, social scientists, ...
The need for studying and promoting web-archiving for longterm information preservation and accessib...
Is software obsolescence a significant risk? To explore this issue, we analysed a corpus of over 2.5...
Is software obsolescence a significant risk? To explore this issue, we analysed a corpus of over 2.5...
Digital archives are not meant to be mere collections of digital artifacts organized for reference. ...
Many institutions are now building rich, significant archives of web content. Though the number of w...
Digital and institutional repositories are changing, and rapidly growing repositories targetting new...
This is an accepted manuscript of an article to be published in the Journal of the Association for I...
Web archives created by the Internet Archive (IA) (https://archive.org), national libraries and othe...
Abstract. Digital libraries have been built all over the world. Libraries are engaged in creating an...
Digital preservation can encompass a range of activities, from simple replication and storage to mor...
International audienceInstitutions that perform web crawls in order to gather heritage collections h...
Institutions that perform web crawls in order to gather heritage collections have millions – or even...
Manager, Digital legal deposit Institutions that perform web crawls in order to gather heritage coll...
International audienceTwenty years after the pioneering experiments performed by Internet Archive an...
Web archives, a key area of digital preservation, meet the needs of journalists, social scientists, ...
The need for studying and promoting web-archiving for longterm information preservation and accessib...
Is software obsolescence a significant risk? To explore this issue, we analysed a corpus of over 2.5...
Is software obsolescence a significant risk? To explore this issue, we analysed a corpus of over 2.5...
Digital archives are not meant to be mere collections of digital artifacts organized for reference. ...
Many institutions are now building rich, significant archives of web content. Though the number of w...
Digital and institutional repositories are changing, and rapidly growing repositories targetting new...
This is an accepted manuscript of an article to be published in the Journal of the Association for I...
Web archives created by the Internet Archive (IA) (https://archive.org), national libraries and othe...
Abstract. Digital libraries have been built all over the world. Libraries are engaged in creating an...
Digital preservation can encompass a range of activities, from simple replication and storage to mor...