SPAM Detection: Naive Bayesian Classification and RPN Expression-Based LGP Approaches Compared

被引:0
|
作者
Meli, Clyde [1 ]
Oplatkova, Zuzana Kominkova [2 ]
机构
[1] Univ Malta, Fac ICT, CIS Dept, Msida, Malta
[2] Tomas Bata Univ Zlin, Fac Appl Informat, Dept Informat & Artificial Intelligence, Nam TG Masaryka 5555, Zlin, Czech Republic
关键词
Reverse polish notation (RPN); Naive bayesian classifier; Spam detection; Linear genetic programming (LGP); Genetic programming (GP);
D O I
10.1007/978-3-319-33622-0_36
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An investigation is performed of a machine learning algorithm and the Bayesian classifier in the spam-filtering context. The paper shows the advantage of the use of Reverse Polish Notation (RPN) expressions with feature extraction compared to the traditional Naive Bayesian classifier used for spam detection assuming the same features. The performance of the two is investigated using a public corpus and a recent private spam collection, concluding that the system based on RPN LGP (Linear Genetic Programming) gave better results compared to two popularly used open source Bayesian spam filters.
引用
收藏
页码:399 / 411
页数:13
相关论文
共 50 条
  • [31] A Framework for Instantaneous Driver Drowsiness Detection Based on Improved HOG Features and Naive Bayesian Classification
    Bakheet, Samy
    Al-Hamadi, Ayoub
    BRAIN SCIENCES, 2021, 11 (02) : 1 - 15
  • [32] Gene Expression-Based Classification of Paediatric Germ Cell Tumors
    Kubota, Y.
    Seki, M.
    Isobe, T.
    Yoshida, K.
    Sato, Y.
    Kataoka, K.
    Shiraishi, Y.
    Chiba, K.
    Tanaka, H.
    Hiwatari, M.
    Miyano, S.
    Hayashi, Y.
    Oka, A.
    Ogawa, S.
    Takita, J.
    PEDIATRIC BLOOD & CANCER, 2016, 63 : S26 - S26
  • [33] Feature (gene) selection in gene expression-based tumor classification
    Xiong, MM
    Li, WJ
    Zhao, JY
    Jin, L
    Boerwinkle, E
    MOLECULAR GENETICS AND METABOLISM, 2001, 73 (03) : 239 - 247
  • [34] An anti-spam filtering system based on the Naive Bayesian Classifier and Distributed Checksum Clearinghouse
    Wang, Haiyan
    Zhou, Runsheng
    Wang, Yi
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 1, PROCEEDINGS, 2009, : 128 - 131
  • [35] ACNB: Associative Classification Mining Based on Naive Bayesian Method
    Odeh, Fadi
    Al-Najdawi, Nijad
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2013, 8 (01) : 23 - 35
  • [36] Fault diagnosis for fuel cell based on naive bayesian classification
    Fan, Liping
    Huang, Xing
    Yi, Liu
    Telkomnika - Indonesian Journal of Electrical Engineering, 2013, 11 (12): : 7664 - 7670
  • [37] Research on Chinese text classification based on Naive Bayesian method
    Geng Xinglong
    Gao Xiuyan
    Zhao Bin
    PROCEEDINGS OF THE FIFTH INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION & INSTRUMENTATION, VOLS 1 AND 2, 2014, : 226 - 230
  • [38] Gene Expression-Based Approaches in Differentiation of Metastases and Second Primary Tumour
    Vooder, Tonu
    Valk, Kristjan
    Kolde, Raivo
    Roosipuu, Retlav
    Vilo, Jaak
    Metspalu, Andres
    CASE REPORTS IN ONCOLOGY, 2010, 3 (02): : 255 - 261
  • [39] A semantic-based classification approach for an enhanced spam detection
    Saidani, Nadjate
    Adi, Kamel
    Allili, Mohand Said
    COMPUTERS & SECURITY, 2020, 94
  • [40] Twitter spam account detection based on clustering and classification methods
    Kayode Sakariyah Adewole
    Tao Han
    Wanqing Wu
    Houbing Song
    Arun Kumar Sangaiah
    The Journal of Supercomputing, 2020, 76 : 4802 - 4837