Intelligent detection of hate speech in Arabic social network: A machine learning approach

被引:45
|
作者
Aljarah, Ibrahim [1 ]
Habib, Maria [1 ]
Hijazi, Neveen [1 ]
Faris, Hossam [1 ]
Qaddoura, Raneem [2 ]
Hammo, Bassam [1 ]
Abushariah, Mohammad [1 ]
Alfawareh, Mohammad [1 ]
机构
[1] Univ Jordan, Queen Rania Str, Amman 19328, Jordan
[2] Philadelphia Univ, Amman, Jordan
关键词
Hate speech; machine learning; text vectorization; Twitter; SENTIMENT ANALYSIS; TWITTER;
D O I
10.1177/0165551520917651
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, cyber hate speech is increasingly growing, which forms a serious problem worldwide by threatening the cohesion of civil societies. Hate speech relates to using expressions or phrases that are violent, offensive or insulting for a person or a minority of people. In particular, in the Arab region, the number of Arab social media users is growing rapidly, which is accompanied with high increasing rate of cyber hate speech. This drew our attention to aspire healthy online environments that are free of hatred and discrimination. Therefore, this article aims to detect cyber hate speech based on Arabic context over Twitter platform, by applying Natural Language Processing (NLP) techniques, and machine learning methods. The article considers a set of tweets related to racism, journalism, sports orientation, terrorism and Islam. Several types of features and emotions are extracted and arranged in 15 different combinations of data. The processed dataset is experimented using Support Vector Machine (SVM), Naive Bayes (NB), Decision Tree (DT) and Random Forest (RF), in which RF with the feature set of Term Frequency-Inverse Document Frequency (TF-IDF) and profile-related features achieves the best results. Furthermore, a feature importance analysis is conducted based on RF classifier in order to quantify the predictive ability of features in regard to the hate class.
引用
收藏
页码:483 / 501
页数:19
相关论文
共 50 条
  • [41] A Machine Learning Approach to Cyberbullying Detection in Arabic Tweets
    Musleh, Dhiaa
    Rahman, Atta
    Alkherallah, Mohammed Abbas
    Al-Bohassan, Menhal Kamel
    Alawami, Mustafa Mohammed
    Alsebaa, Hayder Ali
    Alnemer, Jawad Ali
    Al-Mutairi, Ghazi Fayez
    Aldossary, May Issa
    Aldowaihi, Dalal A.
    Alhaidari, Fahd
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 1033 - 1054
  • [42] An Intelligent Arabic Model for Recruitment Fraud Detection Using Machine Learning
    Sofy, Mohamed A.
    Khafagy, Mohammed H.
    Badry, Rasha M.
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (01) : 102 - 111
  • [43] ABMM: Arabic BERT-Mini Model for Hate-Speech Detection on Social Media
    Almaliki, Malik
    Almars, Abdulqader M.
    Gad, Ibrahim
    Atlam, El-Sayed
    [J]. ELECTRONICS, 2023, 12 (04)
  • [44] A Federated Approach for Hate Speech Detection
    Gala, Jay
    Gandhi, Deep
    Mehta, Jash
    Talat, Zeerak
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3248 - 3259
  • [45] A Deep Learning Approach for Automatic Hate Speech Detection in the Saudi Twittersphere
    Alshalan, Raghad
    Al-Khalifa, Hend
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 16
  • [46] Deep feature fusion for hate speech detection: a transfer learning approach
    Dwivedy, Vishwajeet
    Roy, Pradeep Kumar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (23) : 36279 - 36301
  • [47] Deep feature fusion for hate speech detection: a transfer learning approach
    Vishwajeet Dwivedy
    Pradeep Kumar Roy
    [J]. Multimedia Tools and Applications, 2023, 82 : 36279 - 36301
  • [48] Detecting Hate Speech in Arabic Tweets During COVID-19 Using Machine Learning Approaches
    Alhejaili, Ruba
    Alsaeedi, Abdullah
    Yafooz, Wael M. S.
    [J]. PROCEEDINGS OF THIRD DOCTORAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE, DOSCI 2022, 2023, 479 : 467 - 475
  • [49] Detection of Arabic Cyberbullying on Social Networks Using Machine Learning
    Mouheb, Djedjiga
    Albarghash, Raghad
    Mowakeh, Mohamad Fouzi
    Al Aghbari, Zaher
    Kamel, Ibrahim
    [J]. 2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [50] A survey on hate speech detection and sentiment analysis using machine learning and deep learning models
    Subramanian, Malliga
    Sathiskumar, Veerappampalayam Easwaramoorthy
    Deepalakshmi, G.
    Cho, Jaehyuk
    Manikandan, G.
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2023, 80 : 110 - 121