An approach for spam E-mail detection with support vector machine and n-gram indexing

被引:0
|
作者
Moon, J [1 ]
Shon, T
Seo, J
Kim, J
Seo, J
机构
[1] Korea Univ, Ctr Informat Secur Technol, Seoul 136701, South Korea
[2] ETRI, Natl Secur Res Inst, Taejon, South Korea
[3] Samsung Elect Co, Suwon, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many solutions have been deployed to prevent harmful effects from spam mail. Typical methods are either pattern matching using the keyword or method using the probability such as naive Bayesian method. In this paper, we proposed a classification method of spam mail from normal mail using support vector machine, which has excellent performance in binary pattern classification problems. Especially, the proposed method efficiently practices a learning procedure with a word dictionary by the n-gram. In the conclusion, we showed our proposed method being superior to others in the aspect of comparing performance.
引用
收藏
页码:351 / 362
页数:12
相关论文
共 50 条
  • [1] Spam E-Mail Classification by Utilizing N-Gram Features of Hyperlink Texts
    Bozkir, A. Selman
    Sahin, Esra
    Aydos, Murat
    Sezer, Ebru Akcapinar
    Orhan, Fatih
    [J]. 2017 11TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2017), 2017, : 308 - 312
  • [2] Time-efficient spam e-mail filtering using n-gram models
    Ciltik, Ali
    Gungor, Tunga
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (01) : 19 - 33
  • [3] Attention Mechanism and Support Vector Machine for Image-Based E-Mail Spam Filtering
    Hnini, Ghizlane
    Riffi, Jamal
    Mahraz, Mohamed Adnane
    Yahyaouy, Ali
    Tairi, Hamid
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (02)
  • [4] E-mail spam classification using S-cuckoo search and support vector machine
    Kumaresan, T.
    Palanisamy, C.
    [J]. INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2017, 9 (03) : 142 - 156
  • [5] Machine Learning Based Spam E-mail Detection System for Turkish
    Eryilmaz, Ersin Enes
    Sahin, Durmus Ozkan
    Kilic, Erdal
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 7 - 12
  • [6] On feature extraction for spam e-mail detection
    Gunal, Serkan
    Ergin, Semih
    Gulmezoglu, M. Bilginer
    Gerek, O. Nezih
    [J]. MULTIMEDIA CONTENT REPRESENTATION, CLASSIFICATION AND SECURITY, 2006, 4105 : 635 - 642
  • [7] SpamED: A Spam E-Mail Detection Approach Based on Phrase Similarity
    Pera, Maria Soledad
    Ng, Yiu-Kai
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (02): : 393 - 409
  • [8] Effective computational detection of piRNAs using n-gram models and support vector machine
    Chen, Chun-Chi
    Qian, Xiaoning
    Yoon, Byung-Jun
    [J]. BMC BIOINFORMATICS, 2017, 18
  • [9] Effective computational detection of piRNAs using n-gram models and support vector machine
    Chun-Chi Chen
    Xiaoning Qian
    Byung-Jun Yoon
    [J]. BMC Bioinformatics, 18
  • [10] Research on the Application of E-Mail Classification Based on Support Vector Machine
    Shi, Tiefeng
    [J]. FRONTIERS IN COMPUTER EDUCATION, 2012, 133 : 987 - 994