Recently, many Natural Language Processing (NLP) applications have improved the quality of their output by using various machine learning techniques to mine Information Extraction (IE) patterns for capturing information from the input text. Currently, to mine IE patterns one should know in advance the type of the information that should be captured by these patterns. In this work we propose a novel methodology for corpus analysis based on cross-examination of several document collections representing different instances of the same domain. We show that this methodology can be used for automatic domain template creation. As the problem of automatic domain template creation is rather new, there is no well-defined procedure for the evaluation ...
Many web sites contain large sets of pages generated using a com-mon template or layout. For example...
ABSTRACT Now a Days unstructured and/or semi-structured machine-readable document automatically play...
One of the greatest challenges for search engines and other search tools, which are developed to cop...
Recently, many Natural Language Processing (NLP) applications have improved the quality of their out...
We address the integration of information extraction(IE) and ontologies. In particular, using an ont...
We address the integration of information ex-traction (IE) and ontologies. In particular, us-ing an ...
Abstract-Many web sites contain large sets of pages generated using a common template or layout. For...
\u3cp\u3eIn this paper, we are concerned with the problem of auto-matic template creation for Inform...
This disclosure aims to allow the user the possibility to extract the template of a document using o...
The aim of this paper is to describe a prototype state-of-the-art Information Extraction (IE) system...
Information Extraction (IE) can be defined as the task of automatically extracting preespecified kin...
ter Horst H, Hartung M, Cimiano P, Brazda N, Mueller HW, Klinger R. Learning soft domain constraints...
This paper presents an overview of automatic methods for building domain knowledge structures (domai...
Most of the text mining algorithms in use today are based on lexical representation of input texts, ...
I propose an architecture for a Natural Language Generation system that automatically learns sentenc...
Many web sites contain large sets of pages generated using a com-mon template or layout. For example...
ABSTRACT Now a Days unstructured and/or semi-structured machine-readable document automatically play...
One of the greatest challenges for search engines and other search tools, which are developed to cop...
Recently, many Natural Language Processing (NLP) applications have improved the quality of their out...
We address the integration of information extraction(IE) and ontologies. In particular, using an ont...
We address the integration of information ex-traction (IE) and ontologies. In particular, us-ing an ...
Abstract-Many web sites contain large sets of pages generated using a common template or layout. For...
\u3cp\u3eIn this paper, we are concerned with the problem of auto-matic template creation for Inform...
This disclosure aims to allow the user the possibility to extract the template of a document using o...
The aim of this paper is to describe a prototype state-of-the-art Information Extraction (IE) system...
Information Extraction (IE) can be defined as the task of automatically extracting preespecified kin...
ter Horst H, Hartung M, Cimiano P, Brazda N, Mueller HW, Klinger R. Learning soft domain constraints...
This paper presents an overview of automatic methods for building domain knowledge structures (domai...
Most of the text mining algorithms in use today are based on lexical representation of input texts, ...
I propose an architecture for a Natural Language Generation system that automatically learns sentenc...
Many web sites contain large sets of pages generated using a com-mon template or layout. For example...
ABSTRACT Now a Days unstructured and/or semi-structured machine-readable document automatically play...
One of the greatest challenges for search engines and other search tools, which are developed to cop...