Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT

被引：7

作者：

Muneer, Amgad ^{[1
,2
]}

Alwadain, Ayed ^{[3
]}

Ragab, Mohammed Gamal ^{[2
]}

Alqushaibi, Alawi ^{[2
]}

机构：

[1] Univ Texas MD Anderson Canc Ctr, Dept Imaging Phys, Houston, TX 77030 USA

[2] Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Malaysia

[3] King Saud Univ, Community Coll, Comp Sci Dept, Riyadh 145111, Saudi Arabia

来源：

INFORMATION | 2023年 / 14卷 / 08期

关键词：

cyberbullying detection; ensemble learning; stacked; continuous bag of words; word2vec; Twitter; X platform; Facebook; social media; natural language processing;

D O I：

10.3390/info14080467

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The prevalence of cyberbullying on Social Media (SM) platforms has become a significant concern for individuals, organizations, and society as a whole. The early detection and intervention of cyberbullying on social media are critical to mitigating its harmful effects. In recent years, ensemble learning has shown promising results for detecting cyberbullying on social media. This paper presents an ensemble stacking learning approach for detecting cyberbullying on Twitter using a combination of Deep Neural Network methods (DNNs). It also introduces BERT-M, a modified BERT model. The dataset used in this study was collected from Twitter and preprocessed to remove irrelevant information. The feature extraction process involved utilizing word2vec with Continuous Bag of Words (CBOW) to form the weights in the embedding layer. These features were then fed into a convolutional and pooling mechanism, effectively reducing their dimensionality, and capturing the position-invariant characteristics of the offensive words. The validation of the proposed stacked model and BERT-M was performed using well-known model evaluation measures. The stacked model achieved an F1-score of 0.964, precision of 0.950, recall of 0.92 and the detection time reported was 3 min, which surpasses the previously reported accuracy and speed scores for all known NLP detectors of cyberbullying, including standard BERT and BERT-M. The results of the experiment showed that the stacking ensemble learning approach achieved an accuracy of 97.4% in detecting cyberbullying on Twitter dataset and 90.97% on combined Twitter and Facebook dataset. The results demonstrate the effectiveness of the proposed stacking ensemble learning approach in detecting cyberbullying on SM and highlight the importance of combining multiple models for improved performance.

引用

页数：20

共 50 条

[1] Social Media Cyberbullying Detection using Machine Learning
Hani, John
Nashaat, Mohamed
Ahmed, Mostafa
Emad, Zeyad
Amer, Eslam
Mohammed, Ammar
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 703 - 707
[2] Detection of Cyberbullying on Social Media Platforms Using Machine Learning
Ali, Mohammad Usmaan
Lefticaru, Raluca
ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2023, 2024, 1453 : 220 - 233
[3] Multimodal Cyberbullying Detection Using Ensemble Learning
Khanna, Piyush
Mathur, Abhinav
Chandra, Anunay
Kumar, Akshi
ARTIFICIAL INTELLIGENCE AND SUSTAINABLE COMPUTING FOR SMART CITY, AIS2C2 2021, 2021, 1434 : 221 - 229
[4] Ensemble Learning With Tournament Selected Glowworm Swarm Optimization Algorithm for Cyberbullying Detection on Social Media
Daniel, Ravuri
Murthy, T. Satyanarayana
Kumari, Ch. D. V. P.
Lydia, E. Laxmi
Ishak, Mohamad Khairi
Hadjouni, Myriam
Mostafa, Samih M.
IEEE ACCESS, 2023, 11 : 123392 - 123400
[5] Sentiment analysis of imbalanced datasets using BERT and ensemble stacking for deep learning
Habbat, Nassera
Nouri, Hicham
Anoun, Houda
Hassouni, Larbi
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[6] Automatic detection of cyberbullying behaviour on social media using Stacked Bi-Gru attention with BERT model
Mali, Mohan K. (Mohan.Mali@bharatividyapeeth.edu), 2025, 262
[7] Advancing offensive language detection in Arabic social media: a BERT-based ensemble learning approach
Mazari, Ahmed Cherif
Benterkia, Asmaa
Takdenti, Zineb
SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
[8] Cyberbullying detection: an ensemble learning approach
Roy, Pradeep Kumar
Singh, Ashish
Tripathy, Asis Kumar
Das, Tapan Kumar
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2022, 25 (03) : 315 - 324
[9] An Enhanced BERT Model for Depression Detection on Social Media Posts
Nareshkumar, R.
Nimala, K.
ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 53 - 64
[10] A Knowledge Enhanced Ensemble Learning Model for Mental Disorder Detection on Social Media
Rao, Guozheng
Peng, Chengxia
Zhang, Li
Wang, Xin
Feng, Zhiyong
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT II, 2020, 12275 : 181 - 192

← 1 2 3 4 5 →