Adaptive Information Extraction systems (IES) are currently used by some Semantic Web (SW) annotation tools as support to annotation (Handschuh et al., 2002; Vargas-Vera et al., 2002). They are generally based on fully supervised methodologies requiring fairly intense domain-specific annotation. Unfortunately, selecting representative examples may be difficult and annotations can be incorrect and require time. In this paper we present a methodology that drastically reduce (or even remove) the amount of manual annotation required when annotating consistent sets of pages. A very limited number of user-defined examples are used to bootstrap learning. Simple, high precision (and possibly high recall) IE patterns are induced using such examples,...
Blohm S, Cimiano P. Using the Web to Reduce Data Sparseness in Pattern-based Information Extraction....
The advent of the era of big data on the Web has made automatic web information extraction an essent...
Automatic intelligent web exploration will benefit from shallow information extraction techniques if...
In this paper we propose a methodology to learn to extract domain-specific information from large re...
In this paper we propose a methodology to learn to automatically annotate domain-specific informatio...
Journal ArticleMany information extraction (IE) systems rely on manually annotated training data to ...
This work was carried out within the AKT project (www.aktors.org), sponsored by the UK Engineering a...
The evolution of the Internet into the largest existent digital library is bringing about new challe...
The human effort in large-scale web data extraction significantly affects both the extraction flexib...
Abstract World Wide Web is transforming itself into the largest information re-source making the pro...
Customization to specific domains of dis-course and/or user requirements is one of the greatest chal...
Wrapper induction faces a dilemma: To reach web scale, it requires automatically generated examples,...
The traditional process of document annotation for knowledge identification and extraction in the Se...
The web of today has evolved into a huge repository of rich Multimedia content for human consumptio...
Wrapper induction faces a dilemma: To reach web scale, it requires automatically generated examples,...
Blohm S, Cimiano P. Using the Web to Reduce Data Sparseness in Pattern-based Information Extraction....
The advent of the era of big data on the Web has made automatic web information extraction an essent...
Automatic intelligent web exploration will benefit from shallow information extraction techniques if...
In this paper we propose a methodology to learn to extract domain-specific information from large re...
In this paper we propose a methodology to learn to automatically annotate domain-specific informatio...
Journal ArticleMany information extraction (IE) systems rely on manually annotated training data to ...
This work was carried out within the AKT project (www.aktors.org), sponsored by the UK Engineering a...
The evolution of the Internet into the largest existent digital library is bringing about new challe...
The human effort in large-scale web data extraction significantly affects both the extraction flexib...
Abstract World Wide Web is transforming itself into the largest information re-source making the pro...
Customization to specific domains of dis-course and/or user requirements is one of the greatest chal...
Wrapper induction faces a dilemma: To reach web scale, it requires automatically generated examples,...
The traditional process of document annotation for knowledge identification and extraction in the Se...
The web of today has evolved into a huge repository of rich Multimedia content for human consumptio...
Wrapper induction faces a dilemma: To reach web scale, it requires automatically generated examples,...
Blohm S, Cimiano P. Using the Web to Reduce Data Sparseness in Pattern-based Information Extraction....
The advent of the era of big data on the Web has made automatic web information extraction an essent...
Automatic intelligent web exploration will benefit from shallow information extraction techniques if...