The effect of gender bias on hate speech detection

被引:0
|
作者
Sahinuc, Furkan [1 ]
Yilmaz, Eyup Halit [1 ]
Toraman, Cagri [1 ]
Koc, Aykut [2 ,3 ]
机构
[1] Aselsan Res Ctr, TR-06200 Ankara, Turkey
[2] Bilkent Univ, Dept Elect & Elect Engn, TR-06800 Ankara, Turkey
[3] Bilkent Univ, Natl Magnet Resonance Res Ctr, TR-06800 Ankara, Turkey
关键词
Debiased embedding; Deep learning; Gender identity; Hate speech; Language model;
D O I
10.1007/s11760-022-02368-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hate speech against individuals or communities with different backgrounds is a major problem in online social networks. The domain of hate speech has spread to various topics, including race, religion, and gender. Although there are many efforts for hate speech detection in different domains and languages, the effects of gender identity are not solely examined in hate speech detection. Moreover, hate speech detection is mostly studied for particular languages, specifically English, but not low-resource languages, such as Turkish. We examine gender identity-based hate speech detection for both English and Turkish tweets. We compare the performances of state-of-the-art models using 20 k tweets per language. We observe that transformer-based language models outperform bag-of-words and deep learning models, while the conventional bag-of-words model has surprising performances, possibly due to offensive or hate-related keywords. Furthermore, we analyze the effect of debiased embeddings for hate speech detection. We find that the performance can be improved by removing the gender-related bias in neural embeddings since gender-biased words can have offensive or hateful implications.
引用
收藏
页码:1591 / 1597
页数:7
相关论文
共 50 条
  • [21] Hate speech detection with ADHAR: a multi-dialectal hate speech corpus in Arabic
    Charfi, Anis
    Besghaier, Mabrouka
    Akasheh, Raghda
    Atalla, Andria
    Zaghouani, Wajdi
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [22] Detection of Hate and Offensive Speech in Text
    Wani, Abid Hussain
    Molvi, Nahida Shafi
    Ashraf, Sheikh Ishrah
    [J]. INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 87 - 93
  • [23] THE ANTI GENDER HATE SPEECH ON SOCIAL NETWORKS AS A FORM OF VIOLENCE AGAINST WOMEN AND A FORM OF HATE SPEECH
    Igareda Gonzalez, Noelia
    [J]. DERECHOS Y LIBERTADES, 2022, (47) : 97 - 122
  • [24] Language Agnostic Hate Speech Detection
    Arango, Ayme
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 2475 - 2475
  • [25] Automated Hate Speech Detection on Twitter
    Koushik, Garima
    Rajeswari, K.
    Muthusamy, Suresh Kannan
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [26] Stereotypical Bias Removal for Hate Speech Detection Task using Knowledge-based Generalizations
    Badjatiya, Pinkesh
    Gupta, Manish
    Varma, Vasudeva
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 49 - 59
  • [27] Hate Speech Detection with Comment Embeddings
    Djuric, Nemanja
    Zhou, Jing
    Morris, Robin
    Grbovic, Mihajlo
    Radosavljevic, Vladan
    Bhamidipati, Narayan
    [J]. WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 29 - 30
  • [28] Hate speech detection: Challenges and solutions
    MacAvaney, Sean
    Yao, Hao-Ren
    Yang, Eugene
    Russell, Katina
    Goharian, Nazli
    Frieder, Ophir
    [J]. PLOS ONE, 2019, 14 (08):
  • [29] Constructing ensembles for hate speech detection
    Kucukkaya, Izzet Emre
    Toraman, Cagri
    [J]. NATURAL LANGUAGE PROCESSING, 2024,
  • [30] Topic Oriented Hate Speech Detection
    Jamil, Raihan
    Khan, Mohammad Abdullah Al Nayeem
    Anwar, Md Musfique
    [J]. HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 365 - 375