Transfer learning for hate speech detection in social media

被引:0
|
作者
Lanqin Yuan
Tianyu Wang
Gabriela Ferraro
Hanna Suominen
Marian-Andrei Rizoiu
机构
[1] University of Technology Sydney,
[2] The Australian National University,undefined
[3] University of Turku (UTU),undefined
关键词
Hate speech; Transfer learning; Visualization; Twitter; Domain adaptation; Offensive speech;
D O I
暂无
中图分类号
学科分类号
摘要
Today, the internet is an integral part of our daily lives, enabling people to be more connected than ever before. However, this greater connectivity and access to information increase exposure to harmful content, such as cyber-bullying and cyber-hatred. Models based on machine learning and natural language offer a way to make online platforms safer by identifying hate speech in web text autonomously. However, the main difficulty is annotating a sufficiently large number of examples to train these models. This paper uses a transfer learning technique to leverage two independent datasets jointly and builds a single representation of hate speech. We build an interpretable two-dimensional visualization tool of the constructed hate speech representation—dubbed the Map of Hate—in which multiple datasets can be projected and comparatively analyzed. The hateful content is annotated differently across the two datasets (racist and sexist in one dataset, hateful and offensive in another). However, the common representation successfully projects the harmless class of both datasets into the same space and can be used to uncover labeling errors (false positives). We also show that the joint representation boosts prediction performances when only a limited amount of supervision is available. These methods and insights hold the potential for safer social media and reduce the need to expose human moderators and annotators to distressing online messaging.
引用
收藏
页码:1081 / 1101
页数:20
相关论文
共 50 条
  • [11] Hate and offensive speech detection on Arabic social media
    Alsafari S.
    Sadaoui S.
    Mouhoub M.
    [J]. Online Social Networks and Media, 2020, 19
  • [12] Hate Speech Detection in Social Media for the Kurdish Language
    Saeed, Ari M.
    Ismael, Aso N.
    Rasul, Danya L.
    Majeed, Rayan S.
    Rashid, Tarik A.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INNOVATIONS IN COMPUTING RESEARCH (ICR'22), 2022, 1431 : 253 - 260
  • [13] Hate Speech on Social Media
    Guiora, Amos
    Park, Elizabeth A.
    [J]. PHILOSOPHIA, 2017, 45 (03) : 957 - 971
  • [14] Time of Your Hate: The Challenge of Time in Hate Speech Detection on Social Media
    Florio, Komal
    Basile, Valerio
    Polignano, Marco
    Basile, Pierpaolo
    Patti, Viviana
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (12):
  • [15] Hate speech detection on Twitter using transfer learning
    Ali, Raza
    Farooq, Umar
    Arshad, Umair
    Shahzad, Waseem
    Beg, Mirza Omer
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 74
  • [16] Sinhala Hate Speech Detection in Social Media using Text Mining and Machine learning
    Sandaruwan, H. M. S. T.
    Lorensuhewa, S. A. S.
    Kalyani, M. A. L.
    [J]. 2019 19TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER - 2019), 2019,
  • [17] Hate Speech on Social Media
    Amos Guiora
    Elizabeth A. Park
    [J]. Philosophia, 2017, 45 : 957 - 971
  • [18] A curated dataset for hate speech detection on social media text
    Mody, Devansh
    Huang, YiDong
    de Oliveira, Thiago Eustaquio Alves
    [J]. DATA IN BRIEF, 2023, 46
  • [19] Automatic Hate Speech Detection on Social Media: A Brief Survey
    Alrehili, Ahlam
    [J]. 2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [20] Afaan Oromo Hate Speech Detection and Classification on Social Media
    Ababu, Teshome Mulugeta
    Woldeyohannis, Michael Melese
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6612 - 6619