Detecting hate speech against politicians in Arabic community on social media

被引:13
|
作者
Guellil, Imane [1 ,2 ,3 ]
Adeel, Ahsan [4 ]
Azouaou, Faical [1 ]
Chennoufi, Sara [1 ]
Maafi, Hanene [1 ]
Hamitouche, Thinhinane [1 ]
机构
[1] Ecole Natl Super Informat, Lab Methodes Concept Syst, Algiers, Algeria
[2] Aston Univ, Sch Engn & Appl Sci EAS, Birmingham, W Midlands, England
[3] Folding Space, Birmingham, W Midlands, England
[4] Univ Wolverhampton, Sch Math & Comp Sci, Wolverhampton, England
关键词
Arabic hate speech; COMMUNICATION;
D O I
10.1108/IJWIS-08-2019-0036
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose This paper aims to propose an approach for hate speech detection against politicians in Arabic community on social media (e.g. Youtube). In the literature, similar works have been presented for other languages such as English. However, to the best of the authors' knowledge, not much work has been conducted in the Arabic language. Design/methodology/approach This approach uses both classical algorithms of classification and deep learning algorithms. For the classical algorithms, the authors use Gaussian NB (GNB), Logistic Regression (LR), Random Forest (RF), SGD Classifier (SGD) and Linear SVC (LSVC). For the deep learning classification, four different algorithms (convolutional neural network (CNN), multilayer perceptron (MLP), long- or short-term memory (LSTM) and bi-directional long- or short-term memory (Bi-LSTM) are applied. For extracting features, the authors use both Word2vec and FastText with their two implementations, namely, Skip Gram (SG) and Continuous Bag of Word (CBOW). Findings Simulation results demonstrate the best performance of LSVC, BiLSTM and MLP achieving an accuracy up to 91%, when it is associated to SG model. The results are also shown that the classification that has been done on balanced corpus are more accurate than those done on unbalanced corpus. Originality/value The principal originality of this paper is to construct a new hate speech corpus (Arabic_fr_en) which was annotated by three different annotators. This corpus contains the three languages used by Arabic people being Arabic, French and English. For Arabic, the corpus contains both script Arabic and Arabizi (i.e. Arabic words written with Latin letters). Another originality is to rely on both shallow and deep leaning classification by using different model for extraction features such as Word2vec and FastText with their two implementation SG and CBOW.
引用
收藏
页码:295 / 313
页数:19
相关论文
共 50 条
  • [1] Hate and offensive speech detection on Arabic social media
    Alsafari S.
    Sadaoui S.
    Mouhoub M.
    [J]. Online Social Networks and Media, 2020, 19
  • [2] Detecting Hate Speech on Social Media with Respect to Adolescent Vulnerability
    Chiu, Anna
    Sood, Kanika
    Rincon, Ariadne
    Doran, Davina
    [J]. 2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 724 - 728
  • [3] Detecting Hate Speech in Social Media Articles in Romanized Sinhala
    Hettiarachchi, Nimali
    Weerasinghe, Ruvan
    Pushpanda, Randil
    [J]. 2020 20TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER-2020), 2020, : 250 - 255
  • [4] Detecting weak and strong Islamophobic hate speech on social media
    Vidgen, Bertie
    Yasseri, Taha
    [J]. JOURNAL OF INFORMATION TECHNOLOGY & POLITICS, 2020, 17 (01) : 66 - 78
  • [5] SIREN! Detecting Burmese Hate Speech Comments on Social Media
    Chit, Khin Me Me
    Shein, Yi Yi Chan Myae Win
    Yan, Wai
    Khine, Aye Hninn
    [J]. 2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 119 - 124
  • [6] Hate Speech on Social Media
    Guiora, Amos
    Park, Elizabeth A.
    [J]. PHILOSOPHIA, 2017, 45 (03) : 957 - 971
  • [7] Detecting and visualizing hate speech in social media: A cyber Watchdog for surveillance
    Modha, Sandip
    Majumder, Prasenjit
    Mandl, Thomas
    Mandalia, Chintak
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 161
  • [8] Hate Speech on Social Media
    Amos Guiora
    Elizabeth A. Park
    [J]. Philosophia, 2017, 45 : 957 - 971
  • [9] Towards countering hate speech against journalists on social media
    Charitidis, Polychronis
    Doropoulos, Stavros
    Vologiannidis, Stavros
    Papastergiou, Ioannis
    Karakeva, Sophia
    [J]. Online Social Networks and Media, 2020, 17
  • [10] Journalists' Ethical Responsibility: Tackling Hate Speech Against Women Politicians in Social Media Through Natural Language Processing Techniques
    Iranzo-Cabrera, Maria
    Castro-Bleda, Maria Jose
    Simon-Astudillo, Iris
    Hurtado, Lluis-F.
    [J]. SOCIAL SCIENCE COMPUTER REVIEW, 2024,