Comparative Study on Email Spam Classifier using Data Mining Techniques

被引:0
|
作者
Kumar, R. Kishore [1 ]
Poonkuzhali, G. [2 ]
Sudhakar, P. [3 ]
机构
[1] Sri Sivasubramaniya Nadar Coll Engn, Dept Comp Sci & Engn, Old Mahabalipuram Rd, Ssn Nagar 603110, Tamil Nadu, India
[2] Anna Univ, Rajalakshmi Engn Coll, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
[3] Kamaraj Coll Engn, Dept Comp Sci & Engn, Virudunagar, Tamil Nadu, India
关键词
classifier; e-mail; feature construction; feature selection; relevance analysis; spam;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this e-world, most of the transactions and business is taking place through e-mails. Nowadays, email becomes a powerful tool for communication as it saves a lot of time and cost. But, due to social networks and advertisers, most of the emails contain unwanted information called spam. Even though lot of algorithms has been developed for email spam classification, still none of the algorithms produces 100% accuracy in classifying spam emails. In this paper, spam dataset is analyzed using TANAGRA data mining tool to explore the efficient classifier for email spam classification. Initially, feature construction and feature selection is done to extract the relevant features. Then various classification algorithms are applied over this dataset and cross validation is done for each of these classifiers. Finally, best classifier for email spam is identified based on the error rate, precision and recall.
引用
收藏
页码:539 / 544
页数:6
相关论文
共 50 条
  • [1] Content Based Spam Detection in Email using Bayesian Classifier
    Rathod, Sunil B.
    Pattewar, Tareek M.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 1257 - 1261
  • [2] VGI and crowdsourced data credibility analysis using spam email detection techniques
    Koswatte, Saman
    McDougall, Kevin
    Liu, Xiaoye
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2018, 11 (05) : 520 - 532
  • [3] Spam Email Detection Using Deep Learning Techniques
    AbdulNabi, Isra'a
    Yaseen, Qussai
    [J]. 12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 853 - 858
  • [4] Detecting Arabic YouTube Spam Using Data Mining Techniques
    Tashtoush, Yahya
    Magableh, Areen
    Darwish, Omar
    Smadi, Lujain
    Alomari, Omar
    ALghazoo, Anood
    [J]. 2022 10TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS), 2022,
  • [5] An Intelligent Spam Email Filtering Approach Using a Learning Classifier System
    Al-Ajeli, Ahmed
    Al-Shamery, Eman S.
    Alubady, Raaid
    [J]. INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2022, 22 (03) : 233 - 244
  • [6] Email Phising Detection and Prevention by Using Data Mining Techniques
    Senturk, Serafettin
    Yerli, Elif
    Sogukpinar, Ibrahim
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 707 - 712
  • [7] A Proposed Data Science Approach for Email Spam Classification using Machine Learning Techniques
    Alurkar, Aakash Atul
    Ranade, Sourabh Bharat
    Joshi, Shreeya Vijay
    Ranade, Siddhesh Sanjay
    Sonewar, Piyush A.
    Mahalle, Parikshit N.
    Deshpande, Arvind V.
    [J]. 2017 JOINT 13TH CTTE AND 10TH CMI CONFERENCE ON INTERNET OF THINGS - BUSINESS MODELS, USERS, AND NETWORKS, 2017,
  • [8] A Comparative Study of Estimation by Analogy using Data Mining Techniques
    Nagpal, Geeta
    Uddin, Moin
    Kaur, Arvinder
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2012, 8 (04): : 621 - 652
  • [9] Classification of Textual E-Mail Spam Using Data Mining Techniques
    Alguliev, Rasim M.
    Aliguliyev, Ramiz M.
    Nazirova, Saadat A.
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2011, 2011
  • [10] Predictive analytics for spam email classification using machine learning techniques
    Kumar, Pradeep
    [J]. International Journal of Computer Applications in Technology, 2020, 64 (03): : 282 - 296