Automatic detection of cyberbullying and threatening in Saudi tweets using machine learning

被引:0
|
作者
Alghamdi, Deema [1 ]
Al-Motery, Rahaf [1 ]
Alma'abdi, Reem [1 ]
Alzamzami, Ohoud [1 ]
Babour, Amal [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
Artificial intelligence; Arabic language; Cyberbullying; Text classification; Machine learning;
D O I
10.21833/ijaas.2021.10.003
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Social media has become a major factor in people's lives, which affects their communication and psychological state. The widespread use of social media has formed new types of violence, such as cyberbullying. Manual detection and reporting of violent texts in social media applications are challenging due to the increasing number of social media users and the huge amounts of generated data. Automatic detection of violent texts is language-dependent, and it requires an efficient detection approach, which considers the unique features and structures of a specific language or dialect. Only a few studies have focused on the automatic detection and classification of violent texts in the Arabic Language. This paper aims to build a two-level classifier model for classifying Arabic violent texts. The first level classifies text into violent and non-violent. The second level classifies violent text into either cyberbullying or threatening. The dataset used to build the classifier models is collected from Twitter, using specific keywords and trending hashtags in Saudi Arabia. Supervised machine learning is used to build two classifier models, using two different algorithms, which are Support Vector Machine (SVM), and Naive Bayes (NB). Both models are trained in different experimental settings of varying the feature extraction method and whether stop-word removal is applied or not. The performances of the proposed SVM-based and NB-based models have been compared. The SVM-based model outperforms the NBbased model with F1 scores of 76.06%, and 89.18%, and accuracy scores of 73.35% and 87.79% for the first and second levels of classification, respectively. (c) 2021 The Authors. Published by IASE. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:17 / 25
页数:9
相关论文
共 50 条
  • [1] A Machine Learning Approach to Cyberbullying Detection in Arabic Tweets
    Musleh, Dhiaa
    Rahman, Atta
    Alkherallah, Mohammed Abbas
    Al-Bohassan, Menhal Kamel
    Alawami, Mustafa Mohammed
    Alsebaa, Hayder Ali
    Alnemer, Jawad Ali
    Al-Mutairi, Ghazi Fayez
    Aldossary, May Issa
    Aldowaihi, Dalal A.
    Alhaidari, Fahd
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 1033 - 1054
  • [2] Threatening URDU Language Detection from Tweets Using Machine Learning
    Mehmood, Aneela
    Farooq, Muhammad Shoaib
    Naseem, Ansar
    Rustam, Furqan
    Gracia Villar, Monica
    Lili Rodriguez, Carmen
    Ashraf, Imran
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [3] Detecting Arabic Cyberbullying Tweets Using Machine Learning
    Alduailaj, Alanoud Mohammed
    Belghith, Aymen
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (01): : 29 - 42
  • [4] Cyberbullying detection from tweets using deep learning
    Bharti, Shubham
    Yadav, Arun Kumar
    Kumar, Mohit
    Yadav, Divakar
    [J]. KYBERNETES, 2022, 51 (09) : 2695 - 2711
  • [5] ArCyb: A Robust Machine-Learning Model for Arabic Cyberbullying Tweets in Saudi Arabia
    Mursi, Khalid T.
    Almalki, Abdulrahman Y.
    Alshangiti, Moayad M.
    Alsubaei, Faisal S.
    Alghamdi, Ahmed A.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 1059 - 1067
  • [6] ArCyb: A Robust Machine-Learning Model for Arabic Cyberbullying Tweets in Saudi Arabia
    Mursi, Khalid T.
    Almalki, Abdulrahman Y.
    Alshangiti, Moayad M.
    Alsubaei, Faisal S.
    Alghamdi, Ahmed A.
    [J]. International Journal of Advanced Computer Science and Applications, 2023, 14 (09): : 1059 - 1067
  • [7] Cyberbullying Detection using Machine Learning and Deep Learning
    Alabdulwahab, Aljwharah
    Haq, Mohd Anul
    Alshehri, Mohammed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 424 - 432
  • [8] Detection of Hate Tweets using Machine Learning and Deep Learning
    Ketsbaia, Lida
    Issac, Biju
    Chen, Xiaomin
    [J]. 2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 751 - 758
  • [9] Social Media Cyberbullying Detection using Machine Learning
    Hani, John
    Nashaat, Mohamed
    Ahmed, Mostafa
    Emad, Zeyad
    Amer, Eslam
    Mohammed, Ammar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 703 - 707
  • [10] Cyberbullying Detection for Urdu Language Using Machine Learning
    Mustafa, Hamza
    Zafar, Kashif
    [J]. FORTHCOMING NETWORKS AND SUSTAINABILITY IN THE AIOT ERA, VOL 1, FONES-AIOT 2024, 2024, 1035 : 244 - 257