Ensemble-Based Text Classification for Spam Detection

被引:0
|
作者
Zhang X. [1 ]
Liu G. [1 ]
Zhang M. [1 ]
机构
[1] School of Information Engineering, Tangshan Polytechnic College, Hebei, Tangshan
来源
Informatica (Slovenia) | 2024年 / 48卷 / 06期
关键词
classifier selection; ensemble-based; feature extraction; spam detection; text classification;
D O I
10.31449/inf.v48i6.5246
中图分类号
学科分类号
摘要
This research proposes an ensemble-based approach for spam detection in digital communication, addressing the escalating challenge posed by unsolicited messages, commonly known as spam. The exponential growth of online platforms has necessitated the development of effective information filtering systems to maintain security and efficiency. The proposed approach involves three main components: feature extraction, classifier selection, and decision fusion. The feature extraction techniques are word embedding, are explored to represent text messages effectively. Multiple classifiers, including RNN including LSTM and GRU are evaluated to identify the best performers for spam detection. By employing the ensemble model combines the strengths of individual classifiers to achieve higher accuracy, precision, and recall. The evaluation of the proposed approach utilizes widely accepted metrics on benchmark datasets, ensuring its generalizability and robustness. The experimental results demonstrate that the ensemble-based approach outperforms individual classifiers, offering an efficient solution for combatting spam messages. Integration of this approach into existing spam filtering systems can contribute to improved online communication, user experience, and enhanced cybersecurity, effectively mitigating the impact of spam in the digital landscape. © 2024 Slovene Society Informatika. All rights reserved.
引用
收藏
页码:71 / 80
页数:9
相关论文
共 50 条
  • [41] Deep Anomaly Detection with Ensemble-Based Active Learning
    Tang, Xuning
    Astle, Yihua Shi
    Freeman, Craig
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1663 - 1670
  • [42] Subfeature Ensemble-Based Hyperspectral Anomaly Detection Algorithm
    Wang, Shuo
    Feng, Wei
    Quan, Yinghui
    Bao, Wenxing
    Dauphin, Gabriel
    Gao, Lianru
    Zhong, Xian
    Xing, Mengdao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 5943 - 5952
  • [43] Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks
    Aliaksandr Barushka
    Petr Hajek
    Neural Computing and Applications, 2020, 32 : 4239 - 4257
  • [44] An ensemble-based approach for image classification using voting classifier
    Bhati, Bhoopesh Singh
    Shankar, Achyut
    Saxena, Srishti
    Saxena, Tripti
    Anbarasi, M.
    Kumar, Manoj
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2022, 41 (1-2) : 87 - 97
  • [45] An Efficient, Ensemble-Based Classification Framework for Big Medical Data
    Khan, Firoz
    Prasad, Balusupati Veera Venkata Siva
    Syed, Salman Ali
    Ashraf, Imran
    Ramasamy, Lakshmana Kumar
    BIG DATA, 2022, 10 (02) : 151 - 160
  • [46] Ensemble-Based Knowledge Distillation for Video Anomaly Detection
    Asal, Burcak
    Can, Ahmet Burak
    APPLIED SCIENCES-BASEL, 2024, 14 (03):
  • [47] Detection of financial information manipulation by an ensemble-based mechanism
    Shih, Ching-Hui
    Lin, Sin-Jin
    Hsu, Ming-Fu
    Neural Network World, 2014, 5 (14) : 479 - 499
  • [48] Ensemble-Based Deep Learning Model for Network Traffic Classification
    Aouedi, Ons
    Piamrat, Kandaraj
    Parrein, Benoit
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4124 - 4135
  • [49] An ensemble-based framework for mispronunciation detection of Arabic phonemes
    Calik, Sukru Selim
    Kucukmanisa, Ayhan
    Kilimci, Zeynep Hilal
    APPLIED ACOUSTICS, 2023, 212
  • [50] Novel loss functions for ensemble-based medical image classification
    Rajaraman, Sivaramakrishnan
    Zamzmi, Ghada
    Antani, Sameer K.
    PLOS ONE, 2021, 16 (12):