Unsupervised fine-grained hate speech target community detection and characterisation on social media

被引:2
|
作者
Ollagnier, Anais [1 ]
Cabrio, Elena [1 ]
Villata, Serena [1 ]
机构
[1] Univ Cote Azur, Inria, CNRS, I3S, 930 Route Colles, F-06903 Sophia Antipolis, France
关键词
Fine-grained hate speech target detection; Community detection; Multi-view clustering; Sentence embedding; Social media; MULTIVIEW;
D O I
10.1007/s13278-023-01061-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent studies have highlighted the importance to reach a fine-grained online hate speech characterisation to better understand how hate is conveyed, especially on social media. A key element in this scenario is the identification and characterisation of the hate speech target community, e.g. national, ethnic, and religious minorities. In this paper, we propose a full pipeline relying on unsupervised methods to distinguish specific hate speech manifestations, i.e. targeted (group of) victim(s) and the protected characteristics (target-types) discriminated. Our contribution is threefold: (1) we leverage multiple data views to contrast different abusive behaviours; (2) we explore the use of clustering techniques to perform fine-grained hate speech target community detection, and (3) we address an in-depth content analysis of the generated hate speech target communities. Relying on multiple data views derived from multilingual pre-trained language models (i.e. multilingual BERT and multilingual Universal Sentence Encoder) and the Multi-view Spectral Clustering (MvSC) algorithm, the 69 experiments performed on the Multilingual Hate Speech dataset (MLMA) of tweets show that most of the configurations of the proposed pipeline significantly outperform state-of-the-art clustering algorithms on French and English. Our experiments confirm the ability of the proposed approach to capture complex hate speech phenomena (i.e. intersections between victim-groups, target-types or both).
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Unsupervised fine-grained hate speech target community detection and characterisation on social media
    Anaïs Ollagnier
    Elena Cabrio
    Serena Villata
    Social Network Analysis and Mining, 13
  • [2] Fine-Grained Emotions Influence on Implicit Hate Speech Detection
    Jafari, Amir Reza
    Li, Guanlin
    Rajapaksha, Praboda
    Farahbakhsh, Reza
    Crespi, Noel
    IEEE ACCESS, 2023, 11 : 105330 - 105343
  • [3] A Fine-Grained Taxonomy of Replies to Hate Speech
    Yu, Xinchen
    Zhao, Ashley
    Blanco, Eduardo
    Hong, Lingzi
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7275 - 7289
  • [4] Hierarchical CVAE for Fine-Grained Hate Speech Classification
    Qian, Jing
    ElSherief, Mai
    Belding, Elizabeth
    Wang, William Yang
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3550 - 3559
  • [5] Fine-Grained Multilingual Hate Speech Detection Using Explainable AI and Transformers
    Siddiqui, Jawaid Ahmed
    Yuhaniz, Siti Sophiayati
    Shaikh, Ghulam Mujtaba
    Soomro, Safdar Ali
    Mahar, Zafar Ali
    IEEE ACCESS, 2024, 12 : 143177 - 143192
  • [6] Fine-Grained Prediction of Political Leaning on Social Media with Unsupervised Deep Learning
    Fagni T.
    Cresci S.
    Journal of Artificial Intelligence Research, 2022, 73 : 633 - 672
  • [7] Fine-Grained Prediction of Political Leaning on Social Media with Unsupervised Deep Learning
    Fagni, Tiziano
    Cresci, Stefano
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 73 : 633 - 672
  • [8] Hybrid Approaches to Fine-Grained Emotion Detection in Social Media Data
    Schoene, Annika Marie
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13732 - 13733
  • [9] Vulnerable community identification using hate speech detection on social media
    Mossie, Zewdie
    Wang, Jenq-Haur
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
  • [10] Fine-grained German Sentiment Analysis on Social Media
    Momtazi, Saeedeh
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1215 - 1220