Towards Filtering Spam Mails using Dimensionality Reduction Methods

被引:0
|
作者
Thomas, Josin [1 ]
Raj, Nisha S. [1 ]
Vinod, P. [1 ]
机构
[1] SCMS Sch Engn & Technol, Dept Comp Sci & Engn, Ernakulam, Kerala, India
关键词
feature selection; spam filtering; dimensionality reduction; classifier;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Numerous methods based on the content based filtering is available for email spam identification. Dimensionality of the feature space is recognized as one of the leading factors that affect the efficiency in classifying mails. This study identifies feature selection techniques used in the general text classification for spam filtering. Also, the classification and prediction is performed using different entities of email such as header, body and subject. We present a comparative study of different feature selection methods. Through extensive experiments we demonstrated that Weighted Mutual Information feature selection with header and body of the emails is efficient in email classification.
引用
收藏
页码:163 / 168
页数:6
相关论文
共 50 条
  • [31] Signal processing using methods of dimensionality reduction of representation space
    Popovskiy, VV
    IVTH INTERNATIONAL CONFERENCE ON ANTENNA THEORY AND TECHNIQUES, VOLS 1 AND 2, PROCEEDINGS, 2003, : 419 - 420
  • [32] Echocardiography without electrocardiogram using nonlinear dimensionality reduction methods
    Ahmad Shalbaf
    Zahra AlizadehSani
    Hamid Behnam
    Journal of Medical Ultrasonics, 2015, 42 : 137 - 149
  • [33] Filtering Spam by Using Factors Hyperbolic Trees
    Hou, Hailong
    Chen, Yan
    Beyah, Raheem
    Zhang, Yan-Qing
    GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2008,
  • [34] Image spam filtering using visual information
    Biggio, Battista
    Fumera, Giorgio
    Pillai, Ignazio
    Roli, Fabio
    14TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2007, : 105 - +
  • [35] Spam filtering using Kolmogorov complexity analysis
    Richard, G.
    Doncescu, A.
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2008, 4 (01) : 136 - 148
  • [36] On the Study of Anomaly-based Spam Filtering Using Spam as Representation of Normality
    Laorden, Carlos
    Ugarte-Pedrero, Xabier
    Santos, Igor
    Sanz, Borja
    Nieves, Javier
    Bringas, Pablo G.
    2012 IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE (CCNC), 2012, : 693 - 695
  • [37] Comparison of Deep and Traditional Learning Methods for Email Spam Filtering
    Sheneamer, Abdullah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (01) : 560 - 565
  • [38] An empirical study of three machine learning methods for spam filtering
    Lai, Chih-Chin
    KNOWLEDGE-BASED SYSTEMS, 2007, 20 (03) : 249 - 254
  • [39] Dimensionality Reduction for Noise Filtering of Big Data Sets
    Mahe, E.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2020, 22 (11): : S85 - S85
  • [40] Consistent Spectral Methods for Dimensionality Reduction
    Kharouf, Malika
    Rebafka, Tabea
    Sokolovska, Nataliya
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 286 - 290