Content-based SMS Spam Filtering based on the Scaled Conjugate Gradient Backpropagation Algorithm

被引:0
|
作者
Waheeb, Waddah [1 ,2 ]
Ghazali, Rozaida [1 ]
Deris, Mustafa Mat [1 ]
机构
[1] Univ Tun Hussein Onn Malaysia, Fac Comp Sci & Informat Technol, Batu Pahat Johor 86400, Parit Raja, Malaysia
[2] Hodeidah Univ, Dept Comp Sci, Hodeidah, Yemen
关键词
Content-based SMS spam filtering; artificial neural network; conjugate gradient algorithm; short text classification; feature selection; imbalanced data set; FEATURE-SELECTION; CLASSIFICATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Content-based filtering is one of the most preferred methods to combat Short Message Service (SMS) spam. Memory usage and classification time are essential in SMS spam filtering, especially when working with limited resources. Therefore, suitable feature selection metric and proper filtering technique should be used. In this paper, we investigate how a learnt Artificial Neural Network with the Scaled Conjugate Gradient method (ANN-SCG) is suitable for content-based SMS spam filtering using a small size of features selected by Gini Index (GI) metric. The performance of ANN-SCG is evaluated in terms of true positive rate against false positive rate, Matthews Correlation Coefficient (MCC) and classification time. The evaluation results show the ability of ANN-SCG to filter SMS spam successfully with only one hundred features and a short classification time around to six microseconds. Thus, memory size and filtering time are reduced. An additional testing using unseen SMS messages is done to validate ANN-SCG with the one hundred features. The result again proves the efficiency of ANN-SCG with the one hundred features for SMS spam filtering with accuracy equal to 99.1%.
引用
收藏
页码:675 / 680
页数:6
相关论文
共 50 条
  • [1] Content-based Approach for Vietnamese Spam SMS Filtering
    Pham, Thai-Hoang
    Le-Hong, Phuong
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 41 - 44
  • [2] Content-Based Spam Filtering
    Almeida, Tiago A.
    Yamakami, Akebo
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [3] An Overview of Content-Based Spam Filtering Techniques
    Khorsi, Ahmed
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2007, 31 (03): : 269 - 277
  • [4] A Method of SMS Spam Filtering Based on AdaBoost Algorithm
    Zhang, Xipeng
    Xiong, Gang
    Hu, Yuexiang
    Zhu, Fenghua
    Dong, Xisong
    Nyberg, Timo R.
    [J]. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2328 - 2332
  • [5] A Scaled Conjugate Gradient Backpropagation Algorithm for Keyword Extraction
    Aich, Ankit
    Dutta, Amit
    Chakraborty, Aruna
    [J]. INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 674 - 684
  • [6] Content-based concept drift detection for Email spam filtering
    Zi Hayat, Morteza
    Basiri, Javad
    Seyedhossein, Leila
    Shakery, Azadeh
    [J]. 2010 5th International Symposium on Telecommunications, IST 2010, 2010, : 531 - 536
  • [7] Improving Static SMS Spam Detection by Using New Content-based Features
    Karami, Amir
    Zhou, Lina
    [J]. AMCIS 2014 PROCEEDINGS, 2014,
  • [8] A Fast Content-based Spam Filtering Algorithm with Fuzzy-SVM and K-means
    Wang, Shengnan
    Zhang, Xiaoyong
    Cheng, Yijun
    Jiang, Fu
    Yu, Wentao
    Peng, Jun
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 301 - 307
  • [9] Scaled Conjugate Gradient Backpropagation based SLA Violation Prediction in Cloud Computing
    Upadhyay, Prabhat Kumar
    Pandita, Archana
    Joshi, Nisheeth
    [J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, : 203 - 208
  • [10] SMS Spam Filtering Based on "Cloud Security"
    Wu, Hongli
    Jiang, Yonghui
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2015 - 2019