Automatic detection of cyberbullying and threatening in Saudi tweets using machine learning

被引：0

作者：

Alghamdi, Deema ^{[1
]}

Al-Motery, Rahaf ^{[1
]}

Alma'abdi, Reem ^{[1
]}

Alzamzami, Ohoud ^{[1
]}

Babour, Amal ^{[1
]}

机构：

[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia

来源：

INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES | 2021年 / 8卷 / 10期

关键词：

Artificial intelligence; Arabic language; Cyberbullying; Text classification; Machine learning;

D O I：

10.21833/ijaas.2021.10.003

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Social media has become a major factor in people's lives, which affects their communication and psychological state. The widespread use of social media has formed new types of violence, such as cyberbullying. Manual detection and reporting of violent texts in social media applications are challenging due to the increasing number of social media users and the huge amounts of generated data. Automatic detection of violent texts is language-dependent, and it requires an efficient detection approach, which considers the unique features and structures of a specific language or dialect. Only a few studies have focused on the automatic detection and classification of violent texts in the Arabic Language. This paper aims to build a two-level classifier model for classifying Arabic violent texts. The first level classifies text into violent and non-violent. The second level classifies violent text into either cyberbullying or threatening. The dataset used to build the classifier models is collected from Twitter, using specific keywords and trending hashtags in Saudi Arabia. Supervised machine learning is used to build two classifier models, using two different algorithms, which are Support Vector Machine (SVM), and Naive Bayes (NB). Both models are trained in different experimental settings of varying the feature extraction method and whether stop-word removal is applied or not. The performances of the proposed SVM-based and NB-based models have been compared. The SVM-based model outperforms the NBbased model with F1 scores of 76.06%, and 89.18%, and accuracy scores of 73.35% and 87.79% for the first and second levels of classification, respectively. (c) 2021 The Authors. Published by IASE. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

引用

页码：17 / 25

页数：9

共 50 条

[1] A Machine Learning Approach to Cyberbullying Detection in Arabic Tweets
Musleh, Dhiaa
Rahman, Atta
Alkherallah, Mohammed Abbas
Al-Bohassan, Menhal Kamel
Alawami, Mustafa Mohammed
Alsebaa, Hayder Ali
Alnemer, Jawad Ali
Al-Mutairi, Ghazi Fayez
Aldossary, May Issa
Aldowaihi, Dalal A.
Alhaidari, Fahd
[J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 1033 - 1054
[2] Threatening URDU Language Detection from Tweets Using Machine Learning
Mehmood, Aneela
Farooq, Muhammad Shoaib
Naseem, Ansar
Rustam, Furqan
Gracia Villar, Monica
Lili Rodriguez, Carmen
Ashraf, Imran
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (20):
[3] Detecting Arabic Cyberbullying Tweets Using Machine Learning
Alduailaj, Alanoud Mohammed
Belghith, Aymen
[J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (01): : 29 - 42
[4] Cyberbullying detection from tweets using deep learning
Bharti, Shubham
Yadav, Arun Kumar
Kumar, Mohit
Yadav, Divakar
[J]. KYBERNETES, 2022, 51 (09) : 2695 - 2711
[5] ArCyb: A Robust Machine-Learning Model for Arabic Cyberbullying Tweets in Saudi Arabia
Mursi, Khalid T.
Almalki, Abdulrahman Y.
Alshangiti, Moayad M.
Alsubaei, Faisal S.
Alghamdi, Ahmed A.
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 1059 - 1067
[6] ArCyb: A Robust Machine-Learning Model for Arabic Cyberbullying Tweets in Saudi Arabia
Mursi, Khalid T.
Almalki, Abdulrahman Y.
Alshangiti, Moayad M.
Alsubaei, Faisal S.
Alghamdi, Ahmed A.
[J]. International Journal of Advanced Computer Science and Applications, 2023, 14 (09): : 1059 - 1067
[7] Cyberbullying Detection using Machine Learning and Deep Learning
Alabdulwahab, Aljwharah
Haq, Mohd Anul
Alshehri, Mohammed
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 424 - 432
[8] Detection of Hate Tweets using Machine Learning and Deep Learning
Ketsbaia, Lida
Issac, Biju
Chen, Xiaomin
[J]. 2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 751 - 758
[9] Social Media Cyberbullying Detection using Machine Learning
Hani, John
Nashaat, Mohamed
Ahmed, Mostafa
Emad, Zeyad
Amer, Eslam
Mohammed, Ammar
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 703 - 707
[10] Cyberbullying Detection for Urdu Language Using Machine Learning
Mustafa, Hamza
Zafar, Kashif
[J]. FORTHCOMING NETWORKS AND SUSTAINABILITY IN THE AIOT ERA, VOL 1, FONES-AIOT 2024, 2024, 1035 : 244 - 257

← 1 2 3 4 5 →