Emotionally Informed Hate Speech Detection: A Multi-target Perspective

被引:27
|
作者
Chiril, Patricia [1 ]
Pamungkas, Endang Wahyu [2 ]
Benamara, Farah [1 ]
Moriceau, Veronique [1 ]
Patti, Viviana [2 ]
机构
[1] Univ Toulouse III UPS, Univ Toulouse, IRIT, Toulouse, France
[2] Univ Turin, Dipartimento Informat, Turin, Italy
关键词
Hate speech detection; Hate speech targets; Affective resources; Multi-task learning; Social media; SENTIMENT ANALYSIS; CYBER HATE; TWITTER; CONTEXT; SENTICNET; LANGUAGE; MODEL;
D O I
10.1007/s12559-021-09862-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hate Speech and harassment are widespread in online communication, due to users' freedom and anonymity and the lack of regulation provided by social media platforms. Hate speech is topically focused (misogyny, sexism, racism, xenophobia, homophobia, etc.), and each specific manifestation of hate speech targets different vulnerable groups based on characteristics such as gender (misogyny, sexism), ethnicity, race, religion (xenophobia, racism, Islamophobia), sexual orientation (homophobia), and so on. Most automatic hate speech detection approaches cast the problem into a binary classification task without addressing either the topical focus or the target-oriented nature of hate speech. In this paper, we propose to tackle, for the first time, hate speech detection from a multi-target perspective. We leverage manually annotated datasets, to investigate the problem of transferring knowledge from different datasets with different topical focuses and targets. Our contribution is threefold: (1) we explore the ability of hate speech detection models to capture common properties from topic-generic datasets and transfer this knowledge to recognize specific manifestations of hate speech; (2) we experiment with the development of models to detect both topics (racism, xenophobia, sexism, misogyny) and hate speech targets, going beyond standard binary classification, to investigate how to detect hate speech at a finer level of granularity and how to transfer knowledge across different topics and targets; and (3) we study the impact of affective knowledge encoded in sentic computing resources (SenticNet, EmoSenticNet) and in semantically structured hate lexicons (HurtLex) in determining specific manifestations of hate speech. We experimented with different neural models including multitask approaches. Our study shows that: (1) training a model on a combination of several (training sets from several) topic-specific datasets is more effective than training a model on a topic-generic dataset; (2) the multi-task approach outperforms a single-task model when detecting both the hatefulness of a tweet and its topical focus in the context of a multi-label classification approach; and (3) the models incorporating EmoSenticNet emotions, the first level emotions of SenticNet, a blend of SenticNet and EmoSenticNet emotions or affective features based on Hurtlex, obtained the best results. Our results demonstrate that multi-target hate speech detection from existing datasets is feasible, which is a first step towards hate speech detection for a specific topic/target when dedicated annotated data are missing. Moreover, we prove that domain-independent affective knowledge, injected into our models, helps finer-grained hate speech detection.
引用
下载
收藏
页码:322 / 352
页数:31
相关论文
共 50 条
  • [1] Emotionally Informed Hate Speech Detection: A Multi-target Perspective
    Patricia Chiril
    Endang Wahyu Pamungkas
    Farah Benamara
    Véronique Moriceau
    Viviana Patti
    Cognitive Computation, 2022, 14 : 322 - 352
  • [2] MULTI-TARGET TRACKING BY DETECTION
    Zeng, Qiaoling
    Wen, Gongjian
    Li, Dongdong
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 370 - 374
  • [3] Multi-target Bayes filter with the target detection
    Liu, Zong-xiang
    Zou, Yan-ni
    Xie, Wei-xin
    Li, Liang-qun
    SIGNAL PROCESSING, 2017, 140 : 69 - 76
  • [4] Hate speech detection with ADHAR: a multi-dialectal hate speech corpus in Arabic
    Charfi, Anis
    Besghaier, Mabrouka
    Akasheh, Raghda
    Atalla, Andria
    Zaghouani, Wajdi
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [5] Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech
    Fanton, Margherita
    Bonaldi, Helena
    Tekiroglu, Serra Sinem
    Guerini, Marco
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3226 - 3240
  • [6] Dynamic Factorization based Multi-target Bayesian Filter for Multi-target Detection and Tracking
    Li, Suqi
    Yi, Wei
    Kong, Lingjiang
    Wang, Bailu
    2014 IEEE RADAR CONFERENCE, 2014, : 1251 - 1256
  • [7] Multi-target detection and tracking with a laserscanner
    Mendes, A
    Bento, LC
    Nunes, U
    2004 IEEE INTELLIGENT VEHICLES SYMPOSIUM, 2004, : 796 - 801
  • [8] Robot detection with multi-target tracking
    Tanaka, K
    Kondo, E
    2004 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2004, : 117 - 122
  • [9] Microwave Photonics for Multi-target Detection
    Li, Ming
    Shi, Nuan Nuan
    2016 PROGRESS IN ELECTROMAGNETICS RESEARCH SYMPOSIUM (PIERS), 2016, : 1636 - 1636
  • [10] Multi-target detection in FMCW radar
    Hua, SH
    Zhou, ZG
    Wang, Y
    Zhou, SY
    ICR '96 - 1996 CIE INTERNATIONAL CONFERENCE OF RADAR, PROCEEDINGS, 1996, : 367 - 370