Ensemble-Based Text Classification for Spam Detection

被引:0
|
作者
Zhang X. [1 ]
Liu G. [1 ]
Zhang M. [1 ]
机构
[1] School of Information Engineering, Tangshan Polytechnic College, Hebei, Tangshan
来源
Informatica (Slovenia) | 2024年 / 48卷 / 06期
关键词
classifier selection; ensemble-based; feature extraction; spam detection; text classification;
D O I
10.31449/inf.v48i6.5246
中图分类号
学科分类号
摘要
This research proposes an ensemble-based approach for spam detection in digital communication, addressing the escalating challenge posed by unsolicited messages, commonly known as spam. The exponential growth of online platforms has necessitated the development of effective information filtering systems to maintain security and efficiency. The proposed approach involves three main components: feature extraction, classifier selection, and decision fusion. The feature extraction techniques are word embedding, are explored to represent text messages effectively. Multiple classifiers, including RNN including LSTM and GRU are evaluated to identify the best performers for spam detection. By employing the ensemble model combines the strengths of individual classifiers to achieve higher accuracy, precision, and recall. The evaluation of the proposed approach utilizes widely accepted metrics on benchmark datasets, ensuring its generalizability and robustness. The experimental results demonstrate that the ensemble-based approach outperforms individual classifiers, offering an efficient solution for combatting spam messages. Integration of this approach into existing spam filtering systems can contribute to improved online communication, user experience, and enhanced cybersecurity, effectively mitigating the impact of spam in the digital landscape. © 2024 Slovene Society Informatika. All rights reserved.
引用
收藏
页码:71 / 80
页数:9
相关论文
共 50 条
  • [1] Towards Ensemble-Based Imbalanced Text Classification Using Metric Learning
    Komamizu, Takahiro
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2023, PT II, 2023, 14147 : 188 - 202
  • [2] An ensemble-based framework for user behaviour anomaly detection and classification for cybersecurity
    Gianluigi Folino
    Carla Otranto Godano
    Francesco Sergio Pisani
    The Journal of Supercomputing, 2023, 79 : 11660 - 11683
  • [3] An ensemble-based framework for user behaviour anomaly detection and classification for cybersecurity
    Folino, Gianluigi
    Godano, Carla Otranto
    Pisani, Francesco Sergio
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (11): : 11660 - 11683
  • [4] Ensemble-based deep learning model for welding defect detection and classification
    Vasan, Vinod
    Sridharan, Naveen Venkatesh
    Balasundaram, Rebecca Jeyavadhanam
    Vaithiyanathan, Sugumaran
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [5] Ensemble-based Depression Detection in Speech
    Liu, Zhenyu
    Li, Changcong
    Gao, Xiang
    Wang, Gang
    Yang, Jing
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 975 - 980
  • [6] Ensemble-based adaptive intrusion detection
    Wei, F
    Stolfo, SJ
    PROCEEDINGS OF THE SECOND SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2002, : 41 - 58
  • [7] EBOC: Ensemble-Based Ordinal Classification in Transportation
    Yildirim, Pelin
    Birant, Ulas K.
    Birant, Derya
    JOURNAL OF ADVANCED TRANSPORTATION, 2019, 2019
  • [8] EnClass: Ensemble-based Classification Model for Network Anomaly Detection in Massive Datasets
    Garg, Sahil
    Singh, Amritpal
    Batra, Shalini
    Kumar, Neeraj
    Obaidat, M. S.
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [9] Consensus based Ensemble model for Spam detection
    Pantola, Paritosh
    Bala, Anju
    Rana, Prashant Singh
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1724 - 1727
  • [10] Towards a reliable spam detection: an ensemble classification with rejection option
    Olivo, Cleber
    Santin, Altair O.
    Viegas, Eduardo K.
    Geremias, Jhonatan
    Souto, Eduardo
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (01):