Large linked data repositories have been built by leveraging semi-structured data in Wikipedia (e.g., DBpedia) and through extracting information from natural language text (e.g., YAGO). However, the Web contains many other vast sources of linked data, such as structured HTML tables and spreadsheets. Often, the semantics in such tables is hidden, preventing one from extracting triples from them directly. This paper describes a probabilistic method that augments an existing knowledge base with facts from tabular data by leveraging a Web text corpus and natural language patterns associated with relations in the knowledge base. A preliminary evaluation shows high potential for this technique in augmenting linked data repositories
International audienceThe Semantic Web is an extension of the classical web. The data and schemas it...
Web Information Extraction (WIE) systems extract billions of unique facts, but integrating the asser...
Modern knowledge bases such as Yago [14], DeepDive [19], and Google’s Knowledge Vault [6] are constr...
Large linked data repositories have been built by leveraging semi-structured data in Wikipedia (e.g....
Large linked data repositories have been built by leverag-ing semi-structured data in Wikipedia (e.g...
Cross-domain knowledge bases such as DBpedia, YAGO, or the Google Knowledge Graph have gained increa...
Cross-domain knowledge bases such as YAGO, DBpedia, or the Google Knowledge Graph are being used as ...
The Web has evolved into a huge mine of knowledge carved in different forms, the predominant one sti...
The Web has evolved into a huge mine of knowledge carved in different forms, the predominant one sti...
We study how to extend a large knowledge base (Freebase) by reading relational information from a la...
The Web contains a large number of relational HTML tables, which cover a multitude of different, oft...
The Data Web has undergone a tremendous growth period. It currently consists of more then 3300 publi...
Building large-scale knowledge bases from a variety of data sources is a longstanding goal of AI res...
textabstractTo extend the coverage of Knowledge Bases (KBs), it is useful to integrate factual infor...
HTML tables on web pages ("web tables") have been used successfully as a data source for several app...
International audienceThe Semantic Web is an extension of the classical web. The data and schemas it...
Web Information Extraction (WIE) systems extract billions of unique facts, but integrating the asser...
Modern knowledge bases such as Yago [14], DeepDive [19], and Google’s Knowledge Vault [6] are constr...
Large linked data repositories have been built by leveraging semi-structured data in Wikipedia (e.g....
Large linked data repositories have been built by leverag-ing semi-structured data in Wikipedia (e.g...
Cross-domain knowledge bases such as DBpedia, YAGO, or the Google Knowledge Graph have gained increa...
Cross-domain knowledge bases such as YAGO, DBpedia, or the Google Knowledge Graph are being used as ...
The Web has evolved into a huge mine of knowledge carved in different forms, the predominant one sti...
The Web has evolved into a huge mine of knowledge carved in different forms, the predominant one sti...
We study how to extend a large knowledge base (Freebase) by reading relational information from a la...
The Web contains a large number of relational HTML tables, which cover a multitude of different, oft...
The Data Web has undergone a tremendous growth period. It currently consists of more then 3300 publi...
Building large-scale knowledge bases from a variety of data sources is a longstanding goal of AI res...
textabstractTo extend the coverage of Knowledge Bases (KBs), it is useful to integrate factual infor...
HTML tables on web pages ("web tables") have been used successfully as a data source for several app...
International audienceThe Semantic Web is an extension of the classical web. The data and schemas it...
Web Information Extraction (WIE) systems extract billions of unique facts, but integrating the asser...
Modern knowledge bases such as Yago [14], DeepDive [19], and Google’s Knowledge Vault [6] are constr...