Transfer learning for hate speech detection in social media

被引:0
|
作者
Lanqin Yuan
Tianyu Wang
Gabriela Ferraro
Hanna Suominen
Marian-Andrei Rizoiu
机构
[1] University of Technology Sydney,
[2] The Australian National University,undefined
[3] University of Turku (UTU),undefined
关键词
Hate speech; Transfer learning; Visualization; Twitter; Domain adaptation; Offensive speech;
D O I
暂无
中图分类号
学科分类号
摘要
Today, the internet is an integral part of our daily lives, enabling people to be more connected than ever before. However, this greater connectivity and access to information increase exposure to harmful content, such as cyber-bullying and cyber-hatred. Models based on machine learning and natural language offer a way to make online platforms safer by identifying hate speech in web text autonomously. However, the main difficulty is annotating a sufficiently large number of examples to train these models. This paper uses a transfer learning technique to leverage two independent datasets jointly and builds a single representation of hate speech. We build an interpretable two-dimensional visualization tool of the constructed hate speech representation—dubbed the Map of Hate—in which multiple datasets can be projected and comparatively analyzed. The hateful content is annotated differently across the two datasets (racist and sexist in one dataset, hateful and offensive in another). However, the common representation successfully projects the harmless class of both datasets into the same space and can be used to uncover labeling errors (false positives). We also show that the joint representation boosts prediction performances when only a limited amount of supervision is available. These methods and insights hold the potential for safer social media and reduce the need to expose human moderators and annotators to distressing online messaging.
引用
收藏
页码:1081 / 1101
页数:20
相关论文
共 50 条
  • [1] Transfer learning for hate speech detection in social media
    Yuan, Lanqin
    Wang, Tianyu
    Ferraro, Gabriela
    Suominen, Hanna
    Rizoiu, Marian-Andrei
    [J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2023, 6 (02): : 1081 - 1101
  • [2] Leveraging Transfer Learning for Hate Speech Detection in Portuguese Social Media Posts
    Ramos, Gil
    Batista, Fernando
    Ribeiro, Ricardo
    Fialho, Pedro
    Moro, Sergio
    Fonseca, Antonio
    Guerra, Rita
    Carvalho, Paula
    Marques, Catarina
    Silva, Claudia
    [J]. IEEE ACCESS, 2024, 12 : 101374 - 101389
  • [3] A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media
    Mozafari, Marzieh
    Farahbakhsh, Reza
    Crespi, Noel
    [J]. COMPLEX NETWORKS AND THEIR APPLICATIONS VIII, VOL 1, 2020, 881 : 928 - 940
  • [4] A transfer learning approach for detecting offensive and hate speech on social media platforms
    Ishaani Priyadarshini
    Sandipan Sahu
    Raghvendra Kumar
    [J]. Multimedia Tools and Applications, 2023, 82 : 27473 - 27499
  • [5] A transfer learning approach for detecting offensive and hate speech on social media platforms
    Priyadarshini, Ishaani
    Sahu, Sandipan
    Kumar, Raghvendra
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 27473 - 27499
  • [6] Sinhala Hate Speech Detection in Social Media Using Machine Learning and Deep Learning
    Fernando, W. S. S.
    Weerasinghe, Ruvan
    Bandara, E. R. A. D.
    [J]. 2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [7] A comparative analysis of machine learning algorithms for hate speech detection in social media
    Omran, Esraa
    Al Tararwah, Estabraq
    Al Qundus, Jamal
    [J]. ONLINE JOURNAL OF COMMUNICATION AND MEDIA TECHNOLOGIES, 2023, 13 (04):
  • [8] Advances in Machine Learning Algorithms for Hate Speech Detection in Social Media: A Review
    Mullah, Nanlir Sallau
    Zainon, Wan Mohd Nazmee Wan
    [J]. IEEE ACCESS, 2021, 9 : 88364 - 88376
  • [9] Lifelong Learning of Hate Speech Classification on Social Media
    Qian, Jing
    Wang, Hong
    ElSherief, Mai
    Yan, Xifeng
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2304 - 2314
  • [10] Multimodal Hate Speech Detection in Greek Social Media
    Perifanos, Konstantinos
    Goutsos, Dionysis
    [J]. MULTIMODAL TECHNOLOGIES AND INTERACTION, 2021, 5 (07)