MammoTab is a dataset designed to evaluate semantic table annotation approaches. It includes two types of annotation: cell/mentions to Knowledge Graph (KG) entity matching (CEA task) and; column to KG class matching (CTA task). It is composed of 980254 tables extracted from 21149260 Wikipedia pages and annotated through Wikidata v. 20220708. The dataset is compliant with the data format used in SemTab2019
Richly annotated documents are a part of many knowledge extraction efforts. Such efforts may include...
WikiWoods is an ongoing initiative to provide rich syntacto-semantic annotations for English Wikiped...
MaskedWiki is a large-scale dataset for coreference resolution. It contains 130M passages from Wikip...
tFood is a dataset for tabular data to knowledge graph matching. It is derived for the Food domain a...
This paper describes MantisTable, an open source Semantic Table Interpretation tool, which automatic...
Tabular data to Knowledge Graph matching is the process of assigning semantic tags from knowledge gr...
Tough Tables (2T) is a dataset designed to evaluate table annotation approaches on the CEA task. The...
In recent years, there has been an increasing interest in extracting and annotating tables on the We...
We introduce a framework for automated semantic document annotation that is composed of four process...
This article introduces TableMiner+, a Semantic Table Interpretation method that annotates Web table...
Data sets used for experimental evaluation in the related publication: Matching Web Tables with Kno...
The Semantic Field Book Annotator is a web application developed for domain experts to harvest struc...
LinkingPark is an open-sourced system for automatic semantic table interpretation. Given a table of ...
Data Sets from the ISWC 2022 Semantic Web Challenge on Tabular Data to Knowledge Graph Matching. Fo...
International audienceWeb tables constitute valuable sources of information for various applications...
Richly annotated documents are a part of many knowledge extraction efforts. Such efforts may include...
WikiWoods is an ongoing initiative to provide rich syntacto-semantic annotations for English Wikiped...
MaskedWiki is a large-scale dataset for coreference resolution. It contains 130M passages from Wikip...
tFood is a dataset for tabular data to knowledge graph matching. It is derived for the Food domain a...
This paper describes MantisTable, an open source Semantic Table Interpretation tool, which automatic...
Tabular data to Knowledge Graph matching is the process of assigning semantic tags from knowledge gr...
Tough Tables (2T) is a dataset designed to evaluate table annotation approaches on the CEA task. The...
In recent years, there has been an increasing interest in extracting and annotating tables on the We...
We introduce a framework for automated semantic document annotation that is composed of four process...
This article introduces TableMiner+, a Semantic Table Interpretation method that annotates Web table...
Data sets used for experimental evaluation in the related publication: Matching Web Tables with Kno...
The Semantic Field Book Annotator is a web application developed for domain experts to harvest struc...
LinkingPark is an open-sourced system for automatic semantic table interpretation. Given a table of ...
Data Sets from the ISWC 2022 Semantic Web Challenge on Tabular Data to Knowledge Graph Matching. Fo...
International audienceWeb tables constitute valuable sources of information for various applications...
Richly annotated documents are a part of many knowledge extraction efforts. Such efforts may include...
WikiWoods is an ongoing initiative to provide rich syntacto-semantic annotations for English Wikiped...
MaskedWiki is a large-scale dataset for coreference resolution. It contains 130M passages from Wikip...