As the Web of Linked Open Data is growing the problem of crawling that cloud becomes increasingly important. Unlike normal Web crawlers, a Linked Data crawler performs a selection to focus on collecting linked RDF (including RDFa) data on the Web. From the perspectives of throughput and coverage, given a newly discovered and targeted URI, the key issue of Linked Data crawlers is to decide whether this URI is likely to dereference into an RDF data source and therefore it is worth downloading the representation it points to. Current solutions adopt heuristic rules to filter irrelevant URIs. Unfortunately, when the heuristics are too restrictive this hampers the coverage of crawling. In this paper, we propose and compare approaches to learn st...
This work addresses issues related to the design and implementation of focused crawlers. Several var...
Focused crawlers aim to automatically discover online content resources relevant to a domain of inte...
Association rule mining has been widely studied in the context of basket analysis and sale recommend...
International audienceAs the Web of Linked Open Data is growing the problem of crawling that cloud b...
A Linked Data crawler performs a selection to focus on collecting linked RDF (including RDFa) data o...
International audienceA Linked Data crawler performs a selection to focus on collecting linked RDF (...
Many Linked Open Data applications require fresh copies of RDF data at their local repositories. Sin...
The enormous growth of the World Wide Web in recent years has made it necessary to perform resource ...
In recent years, the World Wide Web has shown enormous growth in size. Vast repositories of informat...
There has been a recent, tangible growth in RDF published on the Web in accordance with the Linked D...
In the world of Linked Data, HTTP URIs are names. A URI is dereferenced to obtain a copy or descript...
Linked Open Data (LOD) is the publicly available RDF data in the Web. Each LOD entity is identfied ...
Web crawling refers to the process of gathering data from the Web. Focused crawlers are programs tha...
This work addresses issues related to the design and implementation of focused crawlers. Several var...
Focused crawlers aim to automatically discover online content resources relevant to a domain of inte...
Association rule mining has been widely studied in the context of basket analysis and sale recommend...
International audienceAs the Web of Linked Open Data is growing the problem of crawling that cloud b...
A Linked Data crawler performs a selection to focus on collecting linked RDF (including RDFa) data o...
International audienceA Linked Data crawler performs a selection to focus on collecting linked RDF (...
Many Linked Open Data applications require fresh copies of RDF data at their local repositories. Sin...
The enormous growth of the World Wide Web in recent years has made it necessary to perform resource ...
In recent years, the World Wide Web has shown enormous growth in size. Vast repositories of informat...
There has been a recent, tangible growth in RDF published on the Web in accordance with the Linked D...
In the world of Linked Data, HTTP URIs are names. A URI is dereferenced to obtain a copy or descript...
Linked Open Data (LOD) is the publicly available RDF data in the Web. Each LOD entity is identfied ...
Web crawling refers to the process of gathering data from the Web. Focused crawlers are programs tha...
This work addresses issues related to the design and implementation of focused crawlers. Several var...
Focused crawlers aim to automatically discover online content resources relevant to a domain of inte...
Association rule mining has been widely studied in the context of basket analysis and sale recommend...