SPAM Detection: Naive Bayesian Classification and RPN Expression-Based LGP Approaches Compared

被引:0
|
作者
Meli, Clyde [1 ]
Oplatkova, Zuzana Kominkova [2 ]
机构
[1] Univ Malta, Fac ICT, CIS Dept, Msida, Malta
[2] Tomas Bata Univ Zlin, Fac Appl Informat, Dept Informat & Artificial Intelligence, Nam TG Masaryka 5555, Zlin, Czech Republic
关键词
Reverse polish notation (RPN); Naive bayesian classifier; Spam detection; Linear genetic programming (LGP); Genetic programming (GP);
D O I
10.1007/978-3-319-33622-0_36
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An investigation is performed of a machine learning algorithm and the Bayesian classifier in the spam-filtering context. The paper shows the advantage of the use of Reverse Polish Notation (RPN) expressions with feature extraction compared to the traditional Naive Bayesian classifier used for spam detection assuming the same features. The performance of the two is investigated using a public corpus and a recent private spam collection, concluding that the system based on RPN LGP (Linear Genetic Programming) gave better results compared to two popularly used open source Bayesian spam filters.
引用
收藏
页码:399 / 411
页数:13
相关论文
共 50 条
  • [21] A protein and mRNA expression-based classification of gastric cancer
    Setia, Namrata
    Agoston, Agoston T.
    Han, Hye S.
    Mullen, John T.
    Duda, Dan G.
    Clark, Jeffrey W.
    Deshpande, Vikram
    Mino-Kenudson, Mari
    Srivastava, Amitabh
    Lennerz, Jochen K.
    Hong, Theodore S.
    Kwak, Eunice L.
    Lauwers, Gregory Y.
    MODERN PATHOLOGY, 2016, 29 (07) : 772 - 784
  • [22] Comparison of a SOM based sequence analysis system and naive Bayesian classifier for spam filtering
    Luo, X
    Zincir-Heywood, N
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2571 - 2576
  • [23] An approach to spam detection by Naive Bayes ensemble based on decision induction
    Yang, Zhen
    Nie, Xiangfei
    Xu, Weiran
    Guo, Jun
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 861 - +
  • [24] Email Spam Classification using Neighbor Probability based Naive Bayes Algorithm
    Anitha, P. U.
    Rao, C. V. Guru
    Babu, Suresh
    2017 7TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2017, : 350 - 355
  • [25] Patent Text Classification Based on Naive Bayesian Method
    Xiao, Lizhong
    Wang, Guangzhong
    Liu, Yuan
    2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2018, : 57 - 60
  • [26] Content Based Spam Detection in Email using Bayesian Classifier
    Rathod, Sunil B.
    Pattewar, Tareek M.
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 1257 - 1261
  • [27] Automatic Classification of Document Resources Based on Naive Bayesian Classification Algorithm
    Wang, Rong
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2022, 46 (03): : 373 - 382
  • [28] Gene expression-based approaches to small molecule discovery for cancer
    Stegmaier, Kimberly
    CANCER RESEARCH, 2009, 69
  • [29] Ensemble-Based Text Classification for Spam Detection
    Zhang X.
    Liu G.
    Zhang M.
    Informatica (Slovenia), 2024, 48 (06): : 71 - 80
  • [30] Bayesian methods for expression-based integration of various types of genomics data
    Jennings, Elizabeth M.
    Morris, Jeffrey S.
    Carroll, Raymond J.
    Manyam, Ganiraju C.
    Baladandayuthapani, Veerabhadran
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2013, Springer Verlag (01):