Incremental Information Gain Analysis of Input Attribute Impact on RBF-Kernel SVM Spam Detection

被引:0
|
作者
He, Hongmei [1 ]
Tiwari, Ashutosh [1 ]
Mehnen, Joern [1 ]
Watson, Tim [2 ]
Maple, Carsten [2 ]
Jin, Yaochu [3 ]
Gabrys, Bogdan [4 ]
机构
[1] Cranfield Univ, Sch Aerosp Transport & Mfg, Cranfield MK43 0AL, Beds, England
[2] Univ Warwick, WMG, Cyber Secur Ctr, Coventry CV4 7AL, W Midlands, England
[3] Univ Surrey, Dept Comp Sci, Guildford GU2 5XH, Surrey, England
[4] Bournemouth Univ, Sch Design Engn & Comp, Poole BH12 5BB, Dorset, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The massive increase of spam is posing a very serious threat to email and SMS, which have become an important means of communication. Not only do spams annoy users, but they also become a security threat. Machine learning techniques have been widely used for spam detection. Email spams can he detected through detecting senders' behaviour, the contents of an email, subject and source address, etc, while SMS spam detection usually is based on the tokens or features of messages due to short content. However, a comprehensive analysis of email/SMS content may provide cures for users to aware of email/SMS spams. We cannot completely depend on automatic tools to identify all spams. In this paper, we propose an analysis 'approach based on information entropy and incremental learning to see how various features affect the performance of an RBF-based SVM spam detector, so that to increase our awareness of a spam by sensing the features of a spam. The experiments were carried out on the spambase and SMSSpeinCollection databases in UCI machine learning repository. The results show that some features have significant impacts on spam detection, of which users should be aware, and there exists a feature space that achieves Pareto efficiency in True Positive Rate and True Negative Rate.
引用
收藏
页码:1022 / 1029
页数:8
相关论文
共 1 条
  • [1] Photovoltaic Hot Spots Detection Based on Kernel Entropy Component Analysis and Information Gain
    Jiang, Shangjun
    Yi, Hui
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 485 - 495