Abstract. Textual patterns have been used effectively to extract information from large text collections. However they rely heavily on textual redundancy in the sense that facts have to be mentioned in a similar manner in order to be generalized to a textual pattern. Data sparseness thus becomes a problem when trying to extract information from hardly redundant sources like corporate intranets, encyclopedic works or scientific databases. We present results on applying a weakly supervised pattern induction algorithm to Wikipedia to extract instances of arbitrary relations. In particular, we apply different configurations of a basic algorithm for pattern induction on seven different datasets. We show that the lack of redundancy leads to the n...
This thesis focuses on the design of algorithms for the extraction of knowledge (in terms of entitie...
International audienceMost of Information Extraction (IE) systems are designed for extracting a rest...
International audienceMost of Information Extraction (IE) systems are designed for extracting a rest...
Blohm S, Cimiano P. Using the Web to Reduce Data Sparseness in Pattern-based Information Extraction....
In this paper we present solutions for the crucial task of extracting structured information from ma...
We present a minimally-supervised approach for learning part whole relations from texts. Unlike prev...
We present a minimally-supervised approach for learning part whole relations from texts. Unlike prev...
We present a minimally-supervised approach for learning part whole relations from texts. Unlike prev...
We present a minimally-supervised approach for learning part whole relations from texts. Unlike prev...
Extracting information from text is the task of obtaining structured, machine-processable facts from...
Extracting information from text is the task of obtaining structured, machine-processable facts from...
The World Wide Web is now undeniably the richest and most dense source of information; yet, its stru...
Extracting information from text is the task of obtaining structured, machine-processable facts from...
This paper proposes the automatic acquisition of binary relational patterns (i.e. portions of text e...
This paper proposes the automatic acquisition of binary relational patterns (i.e. portions of text e...
This thesis focuses on the design of algorithms for the extraction of knowledge (in terms of entitie...
International audienceMost of Information Extraction (IE) systems are designed for extracting a rest...
International audienceMost of Information Extraction (IE) systems are designed for extracting a rest...
Blohm S, Cimiano P. Using the Web to Reduce Data Sparseness in Pattern-based Information Extraction....
In this paper we present solutions for the crucial task of extracting structured information from ma...
We present a minimally-supervised approach for learning part whole relations from texts. Unlike prev...
We present a minimally-supervised approach for learning part whole relations from texts. Unlike prev...
We present a minimally-supervised approach for learning part whole relations from texts. Unlike prev...
We present a minimally-supervised approach for learning part whole relations from texts. Unlike prev...
Extracting information from text is the task of obtaining structured, machine-processable facts from...
Extracting information from text is the task of obtaining structured, machine-processable facts from...
The World Wide Web is now undeniably the richest and most dense source of information; yet, its stru...
Extracting information from text is the task of obtaining structured, machine-processable facts from...
This paper proposes the automatic acquisition of binary relational patterns (i.e. portions of text e...
This paper proposes the automatic acquisition of binary relational patterns (i.e. portions of text e...
This thesis focuses on the design of algorithms for the extraction of knowledge (in terms of entitie...
International audienceMost of Information Extraction (IE) systems are designed for extracting a rest...
International audienceMost of Information Extraction (IE) systems are designed for extracting a rest...