CBFS: A Clustering-Based Feature Selection Mechanism for Network Anomaly Detection

被引:6
|
作者
Mao, Jiewen [1 ]
Hu, Yongquan [1 ]
Jiang, Dong [1 ]
Wei, Tongquan [1 ]
Shen, Fuke [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
关键词
Feature selection; clustering; information gain; classification; decision tree; intrusion detection; INTRUSION DETECTION SYSTEM; ALGORITHM; HYBRID;
D O I
10.1109/ACCESS.2020.3004699
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Network traffic flows contain a large number of correlated and redundant features that significantly degrade the performance of data-driven network anomaly detection. In this paper, we propose a novel clustering and ranking-based feature selection scheme, termed as CBFS, to reduce redundant features in network traffic, which can greatly improve the efficiency and accuracy of feature-based network anomaly detection. Our proposed CBFS scheme first calculates the distance between feature vectors, merges these feature vectors into different clusters, and selects the center of each cluster as a representative feature vector. The proposed CBFS scheme then integrates the information gain and gain rate of features to further streamline the number of features on the basis of clustering generation. Finally, the proposed CBFS scheme applies the decision-tree-based classifier to the generated subset of features so that the abnormal traffic flows are detected. The experimental results show that our proposed CBFS scheme is effective in reducing feature dimensions across different datasets. The proposed CBFS scheme can achieve feature reduction rates of 20% to 70%, and cost-performance of up to 70% as compared to benchmarking methods.
引用
收藏
页码:116216 / 116225
页数:10
相关论文
共 50 条
  • [1] Clustering-based label estimation for network anomaly detection
    Baek, Sunhee
    Kwon, Donghwoon
    Suh, Sang C.
    Kim, Hyunjoo
    Kim, Ikkyun
    Kim, Jinoh
    [J]. DIGITAL COMMUNICATIONS AND NETWORKS, 2021, 7 (01) : 37 - 44
  • [2] A clustering-based feature selection via feature separability
    Jiang, Shengyi
    Wang, Lianxi
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (02) : 927 - 937
  • [3] Network traffic analysis over clustering-based collective anomaly detection
    Wang, Chonghua
    Zhou, Hao
    Hao, Zhiqiang
    Hu, Shu
    Li, Jun
    Zhang, Xueying
    Jiang, Bo
    Chen, Xuehong
    [J]. COMPUTER NETWORKS, 2022, 205
  • [4] Clustering-based Anomaly Detection for Smartphone Applications
    El Attar, Ali
    Khatoun, Rida
    Lemercier, Marc
    [J]. 2014 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2014,
  • [5] Network Anomaly Detection Using Unsupervised Feature Selection and Density Peak Clustering
    Ni, Xiejun
    He, Daojing
    Chan, Sammy
    Ahmad, Farooq
    [J]. APPLIED CRYPTOGRAPHY AND NETWORK SECURITY, ACNS 2016, 2016, 9696 : 212 - 227
  • [6] Entropy-Based Feature Selection for Network Anomaly Detection
    Alabi, Ruth
    Yurtkan, Kamil
    [J]. 2018 2ND INTERNATIONAL SYMPOSIUM ON MULTIDISCIPLINARY STUDIES AND INNOVATIVE TECHNOLOGIES (ISMSIT), 2018, : 563 - 569
  • [7] CLUSTERING-BASED NETWORK INTRUSION DETECTION
    Zhong, Shi
    Khoshgoftaar, Taghi M.
    Seliya, Naeem
    [J]. INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2007, 14 (02) : 169 - 187
  • [8] Clustering-based feature selection for verb sense disambiguation
    Chen, JY
    Palmer, M
    [J]. Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 36 - 41
  • [9] Clustering-based Feature Selection for Internet Attack Defense
    Seo, Jungtaek
    Kim, Jungtae
    Moon, Jongsub
    Kang, Boo Jung
    Im, Eul Gyu
    [J]. INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2008, 1 (01): : 91 - 98
  • [10] A Hybrid Unsupervised Clustering-Based Anomaly Detection Method
    Guo Pu
    Lijuan Wang
    Jun Shen
    Fang Dong
    [J]. Tsinghua Science and Technology, 2021, 26 (02) : 146 - 153