Ensemble-Based Text Classification for Spam Detection

被引:0
|
作者
Zhang X. [1 ]
Liu G. [1 ]
Zhang M. [1 ]
机构
[1] School of Information Engineering, Tangshan Polytechnic College, Hebei, Tangshan
来源
Informatica (Slovenia) | 2024年 / 48卷 / 06期
关键词
classifier selection; ensemble-based; feature extraction; spam detection; text classification;
D O I
10.31449/inf.v48i6.5246
中图分类号
学科分类号
摘要
This research proposes an ensemble-based approach for spam detection in digital communication, addressing the escalating challenge posed by unsolicited messages, commonly known as spam. The exponential growth of online platforms has necessitated the development of effective information filtering systems to maintain security and efficiency. The proposed approach involves three main components: feature extraction, classifier selection, and decision fusion. The feature extraction techniques are word embedding, are explored to represent text messages effectively. Multiple classifiers, including RNN including LSTM and GRU are evaluated to identify the best performers for spam detection. By employing the ensemble model combines the strengths of individual classifiers to achieve higher accuracy, precision, and recall. The evaluation of the proposed approach utilizes widely accepted metrics on benchmark datasets, ensuring its generalizability and robustness. The experimental results demonstrate that the ensemble-based approach outperforms individual classifiers, offering an efficient solution for combatting spam messages. Integration of this approach into existing spam filtering systems can contribute to improved online communication, user experience, and enhanced cybersecurity, effectively mitigating the impact of spam in the digital landscape. © 2024 Slovene Society Informatika. All rights reserved.
引用
收藏
页码:71 / 80
页数:9
相关论文
共 50 条
  • [31] Deep convolutional forest: a dynamic deep ensemble approach for spam detection in text
    Mai A. Shaaban
    Yasser F. Hassan
    Shawkat K. Guirguis
    Complex & Intelligent Systems, 2022, 8 : 4897 - 4909
  • [32] Enhancing Spam Message Classification and Detection Using Transformer-Based Embedding and Ensemble Learning
    Ghourabi, Abdallah
    Alohaly, Manar
    SENSORS, 2023, 23 (08)
  • [33] Expanding analytical capabilities in intrusion detection through ensemble-based multi-label classification
    Hallaji, Ehsan
    Razavi-Far, Roozbeh
    Saif, Mehrdad
    COMPUTERS & SECURITY, 2024, 139
  • [34] Ensemble-Based Algorithm for Synchrophasor Data Anomaly Detection
    Zhou, Mengze
    Wang, Yuhui
    Srivastava, Anurag K.
    Wu, Yinghui
    Banerjee, P.
    IEEE TRANSACTIONS ON SMART GRID, 2019, 10 (03) : 2979 - 2988
  • [35] Ensemble-based exudate detection in color fundus Images
    Nagy, Brigitta
    Antal, Balint
    Harangi, Balazs
    Hajdu, Andras
    PROCEEDINGS OF THE 7TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2011), 2011, : 700 - 703
  • [36] Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks
    Barushka, Aliaksandr
    Hajek, Petr
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09): : 4239 - 4257
  • [37] Ensemble-Based Feature Ranking for Semi-supervised Classification
    Petkovic, Matej
    Dzeroski, Saso
    Kocev, Dragi
    DISCOVERY SCIENCE (DS 2019), 2019, 11828 : 290 - 305
  • [38] Clustering ensemble-based novelty score for outlier detection
    Yu, Jaehong
    Kang, Jihoon
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [39] Detection of financial information manipulation by an ensemble-based mechanism
    Lin, Sin-Jin (annman1204@gmail.com), 1600, Institute of Computer Science Izhevsk (24):
  • [40] DETECTION OF FINANCIAL INFORMATION MANIPULATION BY AN ENSEMBLE-BASED MECHANISM
    Shih, Ching-Hui
    Lin, Sin-Jin
    Hsu, Ming-Fu
    NEURAL NETWORK WORLD, 2014, 24 (05) : 479 - 499