Email Spam Classification by Support Vector Machine

被引:0
|
作者
Singh, Manmohan [1 ]
Pamula, Rajendra [1 ]
Shekhar, Shudhanshu Kumar [1 ]
机构
[1] Indian Sch Mines, Iindian Inst Technol, Dept Comp Sci & Engn, Dhanbad, Bihar, India
关键词
Machine Learning; SVM; Linear Kernel; Gaussian Kernel; SpamAssasin Public Corpus Dataset;
D O I
暂无
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Traditionally spam filtering techniques such as Black and White List were employed but with todays state of the Internet these methods are becoming Obsolete. With increasing popularity of the internet it is difficult to prepare a spam filter to effectively separate the spam mails from useful mails automatically before even they enter the inbox and thus crowding up the space in the inbox. Many computer scientists have been working on the methods to develop a machine learning based algorithm using statistical learning methods to tackle this problem. What is considered as a major concern right now is to make a spam filter that can efficiently capture all the spam messages and all the variety they come in and at the same time perform at a high rate. Within the context of Machine learning SVM can play a major role in spam detections and filtering however SVM faces one problem which is the choice of the kernel for the SVM that direly affects its performance. In this paper, we evaluate the performance of Non Linear SVM based classifiers with two different kernel functions i.e. Linear Kernel and Gaussian Kernel over SpamAssasin Public Corpus Dataset. Furthermore we compare the Training and Testing accuracy of these 2 kernels on the above mentioned dataset and attempt to explain which Kernel Behaves better with which dataset. Then we take some Emails extracted from Gmails Inbox and spam container and test our classifier on them.
引用
收藏
页码:878 / 882
页数:5
相关论文
共 50 条
  • [1] Spam Email Detection Using Deep Support Vector Machine, Support Vector Machine and Artificial Neural Network
    Roy, Sanjiban Sekhar
    Sinha, Abhishek
    Roy, Reetika
    Barna, Cornel
    Samui, Pijush
    SOFT COMPUTING APPLICATIONS, SOFA 2016, VOL 2, 2018, 634 : 162 - 174
  • [2] Header Based Email Spam Detection Framework Using Support Vector Machine (SVM) Technique
    Khamis, Siti Aqilah
    Foozy, Cik Feresa Mohd
    Aziz, Mohd Firdaus Ab
    Rahim, Nordiana
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING (SCDM 2020), 2020, 978 : 57 - 65
  • [3] Integration of support vector machine with Naive Bayesian classifier for spam classification
    Chiu, Chui-Yu
    Huang, Yuan-Ting
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 618 - 622
  • [4] An Email Classification Model Based on Rough Set and Support Vector Machine
    Zhu, Zhiqing
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 236 - 240
  • [5] Visual and textual features based email spam classification using S-Cuckoo search and hybrid kernel support vector machine
    Kumaresan, T.
    Saravanakumar, S.
    Balamurugan, R.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 33 - 46
  • [6] Visual and textual features based email spam classification using S-Cuckoo search and hybrid kernel support vector machine
    T. Kumaresan
    S. Saravanakumar
    R. Balamurugan
    Cluster Computing, 2019, 22 : 33 - 46
  • [7] Improved email spam detection model based on support vector machines
    Sunday Olusanya Olatunji
    Neural Computing and Applications, 2019, 31 : 691 - 699
  • [8] Improved email spam detection model based on support vector machines
    Olatunji, Sunday Olusanya
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (03): : 691 - 699
  • [9] Predictive analytics for spam email classification using machine learning techniques
    Kumar P.
    International Journal of Computer Applications in Technology, 2020, 64 (03): : 282 - 296
  • [10] A Comprehensive Review on Email Spam Classification using Machine Learning Algorithms
    Raza, Mansoor
    Jayasinghe, Nathali Dilshani
    Muslam, Muhana Magboul Ali
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 327 - 332