Unsupervised fine-grained hate speech target community detection and characterisation on social media

被引：2

作者：

Ollagnier, Anais ^{[1
]}

Cabrio, Elena ^{[1
]}

Villata, Serena ^{[1
]}

机构：

[1] Univ Cote Azur, Inria, CNRS, I3S, 930 Route Colles, F-06903 Sophia Antipolis, France

来源：

SOCIAL NETWORK ANALYSIS AND MINING | 2023年 / 13卷 / 01期

关键词：

Fine-grained hate speech target detection; Community detection; Multi-view clustering; Sentence embedding; Social media; MULTIVIEW;

D O I：

10.1007/s13278-023-01061-4

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent studies have highlighted the importance to reach a fine-grained online hate speech characterisation to better understand how hate is conveyed, especially on social media. A key element in this scenario is the identification and characterisation of the hate speech target community, e.g. national, ethnic, and religious minorities. In this paper, we propose a full pipeline relying on unsupervised methods to distinguish specific hate speech manifestations, i.e. targeted (group of) victim(s) and the protected characteristics (target-types) discriminated. Our contribution is threefold: (1) we leverage multiple data views to contrast different abusive behaviours; (2) we explore the use of clustering techniques to perform fine-grained hate speech target community detection, and (3) we address an in-depth content analysis of the generated hate speech target communities. Relying on multiple data views derived from multilingual pre-trained language models (i.e. multilingual BERT and multilingual Universal Sentence Encoder) and the Multi-view Spectral Clustering (MvSC) algorithm, the 69 experiments performed on the Multilingual Hate Speech dataset (MLMA) of tweets show that most of the configurations of the proposed pipeline significantly outperform state-of-the-art clustering algorithms on French and English. Our experiments confirm the ability of the proposed approach to capture complex hate speech phenomena (i.e. intersections between victim-groups, target-types or both).

引用

页数：23

共 50 条

[1] Unsupervised fine-grained hate speech target community detection and characterisation on social media
Anaïs Ollagnier
Elena Cabrio
Serena Villata
Social Network Analysis and Mining, 13
[2] Fine-Grained Emotions Influence on Implicit Hate Speech Detection
Jafari, Amir Reza
Li, Guanlin
Rajapaksha, Praboda
Farahbakhsh, Reza
Crespi, Noel
IEEE ACCESS, 2023, 11 : 105330 - 105343
[3] A Fine-Grained Taxonomy of Replies to Hate Speech
Yu, Xinchen
Zhao, Ashley
Blanco, Eduardo
Hong, Lingzi
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7275 - 7289
[4] Hierarchical CVAE for Fine-Grained Hate Speech Classification
Qian, Jing
ElSherief, Mai
Belding, Elizabeth
Wang, William Yang
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3550 - 3559
[5] Fine-Grained Multilingual Hate Speech Detection Using Explainable AI and Transformers
Siddiqui, Jawaid Ahmed
Yuhaniz, Siti Sophiayati
Shaikh, Ghulam Mujtaba
Soomro, Safdar Ali
Mahar, Zafar Ali
IEEE ACCESS, 2024, 12 : 143177 - 143192
[6] Fine-Grained Prediction of Political Leaning on Social Media with Unsupervised Deep Learning
Fagni T.
Cresci S.
Journal of Artificial Intelligence Research, 2022, 73 : 633 - 672
[7] Fine-Grained Prediction of Political Leaning on Social Media with Unsupervised Deep Learning
Fagni, Tiziano
Cresci, Stefano
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 73 : 633 - 672
[8] Hybrid Approaches to Fine-Grained Emotion Detection in Social Media Data
Schoene, Annika Marie
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13732 - 13733
[9] Vulnerable community identification using hate speech detection on social media
Mossie, Zewdie
Wang, Jenq-Haur
INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (03)
[10] Fine-grained German Sentiment Analysis on Social Media
Momtazi, Saeedeh
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1215 - 1220

← 1 2 3 4 5 →