Email Spam Classification by Support Vector Machine

被引:0
|
作者
Singh, Manmohan [1 ]
Pamula, Rajendra [1 ]
Shekhar, Shudhanshu Kumar [1 ]
机构
[1] Indian Sch Mines, Iindian Inst Technol, Dept Comp Sci & Engn, Dhanbad, Bihar, India
来源
2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON) | 2018年
关键词
Machine Learning; SVM; Linear Kernel; Gaussian Kernel; SpamAssasin Public Corpus Dataset;
D O I
暂无
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Traditionally spam filtering techniques such as Black and White List were employed but with todays state of the Internet these methods are becoming Obsolete. With increasing popularity of the internet it is difficult to prepare a spam filter to effectively separate the spam mails from useful mails automatically before even they enter the inbox and thus crowding up the space in the inbox. Many computer scientists have been working on the methods to develop a machine learning based algorithm using statistical learning methods to tackle this problem. What is considered as a major concern right now is to make a spam filter that can efficiently capture all the spam messages and all the variety they come in and at the same time perform at a high rate. Within the context of Machine learning SVM can play a major role in spam detections and filtering however SVM faces one problem which is the choice of the kernel for the SVM that direly affects its performance. In this paper, we evaluate the performance of Non Linear SVM based classifiers with two different kernel functions i.e. Linear Kernel and Gaussian Kernel over SpamAssasin Public Corpus Dataset. Furthermore we compare the Training and Testing accuracy of these 2 kernels on the above mentioned dataset and attempt to explain which Kernel Behaves better with which dataset. Then we take some Emails extracted from Gmails Inbox and spam container and test our classifier on them.
引用
收藏
页码:878 / 882
页数:5
相关论文
共 50 条
  • [11] Email Spam Classification and Detection using Various Machine Learning Classifiers
    Saraswathi, N.
    Pradeep, S.
    Sathiyavathi, V.
    Sabitha, K.
    Kambattan, K. Rajesh
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [12] Predictive analytics for spam email classification using machine learning techniques
    Kumar, Pradeep
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2020, 64 (03) : 282 - 296
  • [13] Extreme Learning Machines and Support Vector Machines Models for Email spam detection
    Olatunji, Sunday Olusanya
    2017 IEEE 30TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2017,
  • [14] Support Vector Machine Based Spam SMS Detection
    Tekerek, Adem
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2019, 22 (03): : 779 - 784
  • [15] Email Sentiment Analysis Through k-Means Labeling and Support Vector Machine Classification
    Liu, Sisi
    Lee, Ickjai
    CYBERNETICS AND SYSTEMS, 2018, 49 (03) : 181 - 199
  • [16] Data Classification with Support Vector Machine and Generalized Support Vector Machine
    Qi, Xiaomin
    Silvestrov, Sergei
    Nazir, Talat
    ICNPAA 2016 WORLD CONGRESS: 11TH INTERNATIONAL CONFERENCE ON MATHEMATICAL PROBLEMS IN ENGINEERING, AEROSPACE AND SCIENCES, 2017, 1798
  • [17] An Adaptive Neural Network for Email Spam Classification
    Kumar, Jitendra
    Santhanavijayan, A.
    Rajendran, Balaji
    Bindhumadhava, B. S.
    2019 FIFTEENTH INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICINPRO): INTERNET OF THINGS, 2019, : 54 - 60
  • [18] An SMS Spam Filtering System Using Support Vector Machine
    Joe, Inwhee
    Shim, Hyetaek
    FUTURE GENERATION INFORMATION TECHNOLOGY, 2010, 6485 : 577 - 584
  • [19] Research on spam filtering technology using Support Vector Machine
    Mei, Zheng
    Ji, Geng
    Xiao, Li
    Qiao, Liu
    2007 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1 AND 2: VOL 1: COMMUNICATION THEORY AND SYSTEMS; VOL 2: SIGNAL PROCESSING, COMPUTATIONAL INTELLIGENCE, CIRCUITS AND SYSTEMS, 2007, : 492 - +
  • [20] A Proposed Data Science Approach for Email Spam Classification using Machine Learning Techniques
    Alurkar, Aakash Atul
    Ranade, Sourabh Bharat
    Joshi, Shreeya Vijay
    Ranade, Siddhesh Sanjay
    Sonewar, Piyush A.
    Mahalle, Parikshit N.
    Deshpande, Arvind V.
    2017 JOINT 13TH CTTE AND 10TH CMI CONFERENCE ON INTERNET OF THINGS - BUSINESS MODELS, USERS, AND NETWORKS, 2017,