Improving detection accuracy of politically motivated cyber-hate using heterogeneous stacked ensemble (HSE) approach

被引:3
|
作者
Mullah, Nanlir Sallau [1 ,2 ]
Zainon, Wan Mohd Nazmee Wan [1 ]
机构
[1] Univ Sains Malaysia, Sch Comp Sci, George Town 11800, Malaysia
[2] Fed Coll Educ Pankshin, PMB1027, Pankshin, Plateau State, Nigeria
关键词
Text categorization; Stacking ensemble; Machine learning; Hate speech; Social media platforms; Political discourse; TF-IDF; MACHINE; SPEECH;
D O I
10.1007/s12652-022-03763-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The surge in cyber-hate crimes is largely fuelled by the popularization of social media platforms. On that note, cyber-hate has become an increasing concern for most countries, especially those that are practising democracy. Studies on the influence of social media (SM) on political discourse have now become an important research area due to the rising trends of SM politics. It becomes necessary to address this problem using automated social intelligence. To tackle this concern, the researchers built a novel heterogeneous stacked ensemble (HSE) classifier for detecting politically motivated cyber-hate on Twitter. We constructed a heterogeneous stacked ensemble with eight baseline estimators. In the proposed methodology, the researchers employed TF-IDF for feature vectorisation. The researchers used Twitter API for data scraping to harvest tweets during a gubernatorial election in Nigeria for the training and evaluation of the stacked ensemble model. A total of 15,502 tweets were collected and after some preliminary cleaning, 5876 tweets were manually labelled as hate (1) or non-hate (0). The coded tweets contain 16.87% hate and 83.13% non-hate tweets. This article has three contributions - a critical review of literature on the detection of politically motivated cyber-hate, the building of a new dataset and the proposed stacked ensemble method. Two other public datasets (Kaggle and HASOC) were used to test the performance of our method. The F1-score metric was employed for comparison. Our method is better by 12% on the Kaggle and 4% on the HASOC datasets. We are working on more data for deep learning experiments.
引用
收藏
页码:12179 / 12190
页数:12
相关论文
共 19 条
  • [1] Improving detection accuracy of politically motivated cyber-hate using heterogeneous stacked ensemble (HSE) approach
    Nanlir Sallau Mullah
    Wan Mohd Nazmee Wan Zainon
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 12179 - 12190
  • [2] A Multi-Stage Machine Learning and Fuzzy Approach to Cyber-Hate Detection
    Ketsbaia, Lida
    Issac, Biju
    Chen, Xiaomin
    Jacob, Seibu Mary
    IEEE ACCESS, 2023, 11 : 56046 - 56065
  • [3] Improving spam email classification accuracy using ensemble techniques: a stacking approach
    Adnan, Muhammad
    Imam, Muhammad Osama
    Javed, Muhammad Furqan
    Murtza, Iqbal
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, 23 (01) : 505 - 517
  • [4] Improving spam email classification accuracy using ensemble techniques: a stacking approach
    Muhammad Adnan
    Muhammad Osama Imam
    Muhammad Furqan Javed
    Iqbal Murtza
    International Journal of Information Security, 2024, 23 : 505 - 517
  • [5] Improving protein-protein interactions prediction accuracy using XGBoost feature selection and stacked ensemble classifier
    Chen, Cheng
    Zhang, Qingmei
    Yu, Bin
    Yu, Zhaomin
    Lawrence, Patrick J.
    Ma, Qin
    Zhang, Yan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 123
  • [6] Improving Hate Speech Detection Accuracy using Hybrid CNN-RNN and Random Oversampling Techniques
    Riyadi, Slamet
    Andriyani, Annisa Divayu
    Masyhur, Ahmad Musthafa
    2024 IEEE SYMPOSIUM ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ISIEA 2024, 2024,
  • [7] Automatic hate speech detection using killer natural language processing optimizing ensemble deep learning approach
    Al-Makhadmeh, Zafer
    Tolba, Amr
    COMPUTING, 2020, 102 (02) : 501 - 522
  • [8] A Novel Approach for Structural Damage Detection Using Multi-Headed Stacked Deep Ensemble Learning
    Asghari, Arghavan
    Amiri, Gholamreza Ghodrati
    Darvishan, Ehsan
    Asghari, Arian
    JOURNAL OF VIBRATION ENGINEERING & TECHNOLOGIES, 2024, 12 (03) : 4209 - 4224
  • [9] A Novel Approach for Structural Damage Detection Using Multi-Headed Stacked Deep Ensemble Learning
    Arghavan Asghari
    Gholamreza Ghodrati Amiri
    Ehsan Darvishan
    Arian Asghari
    Journal of Vibration Engineering & Technologies, 2024, 12 : 4209 - 4224
  • [10] Automatic hate speech detection using killer natural language processing optimizing ensemble deep learning approach
    Zafer Al-Makhadmeh
    Amr Tolba
    Computing, 2020, 102 : 501 - 522