Cognitive computing systems require human-labeled data for evaluation and often for training. The standard practice used in gathering this data minimizes disagreement between annotators, and we have found this results in data that fails to account for the ambiguity inherent in language. We have proposed the CrowdTruth method for collecting ground truth through crowdsourcing, which reconsiders the role of people in machine learning based on the observation that disagreement between annotators provides a useful signal for phenomena such as ambiguity in the text. We report on using this method to build an annotated data set for medical relation extraction for the cause and treat relations and how this data performed in a supervised training ex...
Ambiguity, complexity, and diversity in natural language textual expressions are major hindrances to...
Typically crowdsourcing-based approaches to gather annotated data use inter-annotator agreement as a...
Imagine we show an image to a person and ask her/him to decide whether the scene in the image is war...
<p>The lack of annotated datasets for training and benchmarking is one of the main challenges of Cli...
<p>The lack of annotated datasets for training and benchmarking is one of the main challenges of Cli...
A widespread use of linked data for information extraction is distant supervision, in which relation...
<p>One of the rst steps in most web data analytics is creating a human annotated ground truth, typic...
Abstract. In this paper, we introduce the CrowdTruth open-source soft-ware framework for machine-hum...
Abstract. This paper proposes an approach to gathering semantic an-notation, which rejects the notio...
Gathering training and evaluation data for open domain tasks, such as general question answering, is...
This repository contains a ground truth corpus for open domain relation extraction from sentences, a...
The process of gathering ground truth data through human annotation is a major bottleneck in the use...
In this tutorial, we introduce a novel crowdsourcing methodology called CrowdTruth [1, 9]. The centr...
Automatic information extraction (IE) enables the construction of very large knowledge bases (KBs), ...
Automatic information extraction (IE) enables the construction of very large knowledge bases (KBs), ...
Ambiguity, complexity, and diversity in natural language textual expressions are major hindrances to...
Typically crowdsourcing-based approaches to gather annotated data use inter-annotator agreement as a...
Imagine we show an image to a person and ask her/him to decide whether the scene in the image is war...
<p>The lack of annotated datasets for training and benchmarking is one of the main challenges of Cli...
<p>The lack of annotated datasets for training and benchmarking is one of the main challenges of Cli...
A widespread use of linked data for information extraction is distant supervision, in which relation...
<p>One of the rst steps in most web data analytics is creating a human annotated ground truth, typic...
Abstract. In this paper, we introduce the CrowdTruth open-source soft-ware framework for machine-hum...
Abstract. This paper proposes an approach to gathering semantic an-notation, which rejects the notio...
Gathering training and evaluation data for open domain tasks, such as general question answering, is...
This repository contains a ground truth corpus for open domain relation extraction from sentences, a...
The process of gathering ground truth data through human annotation is a major bottleneck in the use...
In this tutorial, we introduce a novel crowdsourcing methodology called CrowdTruth [1, 9]. The centr...
Automatic information extraction (IE) enables the construction of very large knowledge bases (KBs), ...
Automatic information extraction (IE) enables the construction of very large knowledge bases (KBs), ...
Ambiguity, complexity, and diversity in natural language textual expressions are major hindrances to...
Typically crowdsourcing-based approaches to gather annotated data use inter-annotator agreement as a...
Imagine we show an image to a person and ask her/him to decide whether the scene in the image is war...