Spam detection on social networks using cost-sensitive feature selection and ensemble-based regularized deep neural networks

被引:34
|
作者
Barushka, Aliaksandr [1 ]
Hajek, Petr [1 ]
机构
[1] Univ Pardubice, Inst Syst Engn & Informat, Fac Econ & Adm, Studentska 84, Pardubice 53210, Czech Republic
来源
NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 09期
关键词
Neural network; Social networks; Regularization; Ensemble learning; Misclassification cost; DETECTION SYSTEM; ACCOUNTS;
D O I
10.1007/s00521-019-04331-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spam detection on social networks is increasingly important owing to the rapid growth of social network user base. Sophisticated spam filters must be developed to deal with this complex problem. Traditional machine learning approaches such as neural networks, support vector machines and Naive Bayes classifiers are not effective enough to process and utilize complex features present in high-dimensional data on social network spam. Moreover, the traditional objective criteria of social network spam filters cannot cope with different costs assigned to type I and type II errors. To overcome these problems, here we propose a novel cost-sensitive approach to social network spam filtering. The proposed approach is composed of two stages. In the first stage, multi-objective evolutionary feature selection is used to minimize both the misclassification cost of the proposed model and the number of attributes necessary for spam filtering. Then, the approach uses cost-sensitive ensemble learning techniques with regularized deep neural networks as base learners. We demonstrate that this approach is effective for social network spam filtering on two benchmark datasets. We also show that the proposed approach outperforms other popular algorithms used in social network spam filtering, such as random forest, Naive Bayes or support vector machines.
引用
收藏
页码:4239 / 4257
页数:19
相关论文
共 50 条
  • [21] Ensemble-based community detection in multilayer networks
    Tagarelli, Andrea
    Amelio, Alessia
    Gullo, Francesco
    DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (05) : 1506 - 1543
  • [22] Adaptive cost-sensitive stance classification model for rumor detection in social networks
    Zojaji, Zahra
    Tork Ladani, Behrouz
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [23] Spam detection on social networks using deep contextualized word representation
    Razan Ghanem
    Hasan Erbay
    Multimedia Tools and Applications, 2023, 82 : 3697 - 3712
  • [24] Spam detection on social networks using deep contextualized word representation
    Ghanem, Razan
    Erbay, Hasan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3697 - 3712
  • [25] Cost-sensitive feature selection based on Adaptive Hunting Optimization
    Liang, Yixuan
    2024 4TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE, CCAI 2024, 2024, : 546 - 551
  • [26] Cost-sensitive Hybrid Neural Networks for Heterogeneous and Imbalanced Data
    Jiang, Xinxin
    Pan, Shirui
    Long, Guodong
    Chang, Jiang
    Jiang, Jing
    Zhang, Chengqi
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [27] Cost-sensitive boosting neural networks for software defect prediction
    Zheng, Jun
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (06) : 4537 - 4543
  • [28] Spam detection in online social networks by deep learning
    Ameen, Aso Khaleel
    Kaya, Buket
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [29] FS2RNN: Feature Selection Scheme for Web Spam Detection using Recurrent Neural Networks
    Makkar, Aaisha
    Obaidat, Mohammad S.
    Kumar, Neeraj
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [30] Cost-Sensitive Neural Networks and Editing Techniques for Imbalance Problems
    Alejo, R.
    Sotoca, J. M.
    Garcia, V.
    Valdovinos, R. M.
    ADVANCES IN PATTERN RECOGNITION, 2010, 6256 : 180 - +