Crowdsourced annotation is vital to both collecting labelled data to train and test automated content moderation systems and to support human-in-the-loop review of system decisions. However, annotation tasks such as judging hate speech are subjective and thus highly sensitive to biases stemming from annotator beliefs, characteristics and demographics. We conduct two crowdsourcing studies on Mechanical Turk to examine annotator bias in labelling sexist and misogynistic hate speech. Results from 109 annotators show that annotator political inclination, moral integrity, personality traits, and sexist attitudes significantly impact annotation accuracy and the tendency to tag content as hate speech. In addition, semi-structured interviews with n...
One of the major bottlenecks in the development of data-driven AI Systems is the cost of reliable hu...
Annotators are not fungible. Their demographics, life experiences, and backgrounds all contribute to...
Crowdsourcing is a common strategy for collecting the “gold standard ” labels required for many natu...
In NLP annotation, it is common to have multiple annotators label the text and then obtain the groun...
Reference texts such as encyclopedias and news articles can manifest biased language when objective ...
Since state-of-the-art approaches to offensive language detection rely on supervised learning, it is...
Social Networking Sites are home to different forms of hate, including "Misogynoir", which specifica...
Human computation is often subject to systematic biases. We consider the case of linguistic biases a...
Researchers in computer science have spent considerable time developing methods to increase the accu...
Labelling, or annotation, is the process by which we assign labels to an item with regards to a task...
Human annotations can help indexing digital resources as well as improving search and recommendation...
Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in...
With the prevalence of machine learning in natural language processing and other fields, an increasi...
In this paper we present a proposal to address the problem of the pricey and unreliable human annota...
This project developed and tested an alternative methodology for dataset creation informing AI hate ...
One of the major bottlenecks in the development of data-driven AI Systems is the cost of reliable hu...
Annotators are not fungible. Their demographics, life experiences, and backgrounds all contribute to...
Crowdsourcing is a common strategy for collecting the “gold standard ” labels required for many natu...
In NLP annotation, it is common to have multiple annotators label the text and then obtain the groun...
Reference texts such as encyclopedias and news articles can manifest biased language when objective ...
Since state-of-the-art approaches to offensive language detection rely on supervised learning, it is...
Social Networking Sites are home to different forms of hate, including "Misogynoir", which specifica...
Human computation is often subject to systematic biases. We consider the case of linguistic biases a...
Researchers in computer science have spent considerable time developing methods to increase the accu...
Labelling, or annotation, is the process by which we assign labels to an item with regards to a task...
Human annotations can help indexing digital resources as well as improving search and recommendation...
Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in...
With the prevalence of machine learning in natural language processing and other fields, an increasi...
In this paper we present a proposal to address the problem of the pricey and unreliable human annota...
This project developed and tested an alternative methodology for dataset creation informing AI hate ...
One of the major bottlenecks in the development of data-driven AI Systems is the cost of reliable hu...
Annotators are not fungible. Their demographics, life experiences, and backgrounds all contribute to...
Crowdsourcing is a common strategy for collecting the “gold standard ” labels required for many natu...