The proliferation of hate speech is a growing challenge for social media platforms, as toxic online comments can have dangerous consequences also in real life. There is a need for tools that can automatically and reliably detect hateful comments, and deep learning models have proven effective in solving this issue. However, these models have been shown to have unintended bias against some categories of people. Specifically, they may classify comments that reference certain frequently attacked identities (such as gay, black, or Muslim) as toxic even if the comments themselves are actually not toxic (e.g. ”I am Muslim”). To address this bias, previous authors introduced an Entropy-based Attention Regularization (EAR) method which, when applie...
Hate speech is an important problem in the management of user-generated content. To remove offensive...
In this paper, we introduce HateBERT, a re-trained BERT model for abusive language detection in Engl...
Abstract This paper presents the results and main findings of the HASOC-2021 Hate/Offensive Languag...
The proliferation of hate speech is a growing challenge for social media platforms, as toxic online ...
Denne avhandlingen presenterer en ny løsning for deteksjon av hatefulle ytringer som kombinerer fler...
Denne avhandlingen presenterer en ny løsning for deteksjon av hatefulle ytringer som kombinerer fler...
Disparate biases associated with datasets and trained classifiers in hateful and abusive content ide...
Sosiale internettjenester har sett en økning i uønsket innhold slik som fornærmende eller hatefulle ...
Over the past two decades, online discussion has skyrocketed in scope and scale. However, so has the...
In this work we present a new Norwegian labeled dataset of 7078 comments for unhealthy comment detec...
Hate speech on online social media platforms is now at a level that has been considered a serious co...
As insulting statements become more frequent on online platforms, these negative statements create a...
Forskning på sikkerhet i sosiale medier har vokst betydelig det siste tiåret. Med den utbredte bruke...
International audienceGenerated hateful and toxic content by a portion of users in social media is a...
Disclaimer: This paper contains content that can be perceived as offensive or upsetting. Considerabl...
Hate speech is an important problem in the management of user-generated content. To remove offensive...
In this paper, we introduce HateBERT, a re-trained BERT model for abusive language detection in Engl...
Abstract This paper presents the results and main findings of the HASOC-2021 Hate/Offensive Languag...
The proliferation of hate speech is a growing challenge for social media platforms, as toxic online ...
Denne avhandlingen presenterer en ny løsning for deteksjon av hatefulle ytringer som kombinerer fler...
Denne avhandlingen presenterer en ny løsning for deteksjon av hatefulle ytringer som kombinerer fler...
Disparate biases associated with datasets and trained classifiers in hateful and abusive content ide...
Sosiale internettjenester har sett en økning i uønsket innhold slik som fornærmende eller hatefulle ...
Over the past two decades, online discussion has skyrocketed in scope and scale. However, so has the...
In this work we present a new Norwegian labeled dataset of 7078 comments for unhealthy comment detec...
Hate speech on online social media platforms is now at a level that has been considered a serious co...
As insulting statements become more frequent on online platforms, these negative statements create a...
Forskning på sikkerhet i sosiale medier har vokst betydelig det siste tiåret. Med den utbredte bruke...
International audienceGenerated hateful and toxic content by a portion of users in social media is a...
Disclaimer: This paper contains content that can be perceived as offensive or upsetting. Considerabl...
Hate speech is an important problem in the management of user-generated content. To remove offensive...
In this paper, we introduce HateBERT, a re-trained BERT model for abusive language detection in Engl...
Abstract This paper presents the results and main findings of the HASOC-2021 Hate/Offensive Languag...