Relevance Feature Selection with Data Cleaning for Intrusion Detection System

被引:0
|
作者
Suthaharan, Shan [1 ]
Panchagnula, Tejaswi [1 ]
机构
[1] Univ N Carolina, Dept Comp Sci, Greensboro, NC 27412 USA
关键词
intrusion detection; Rough Set Theory; labeled datasets; NSL-KDD dataset; relevance feature selection;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Labeled datasets play a major role in the process of validating and evaluating machine learning techniques in intrusion detection systems. In order to obtain good accuracy in the evaluation, very large datasets should be considered. Intrusion traffic and normal traffic are in general dependent on a large number of network characteristics called features. However not all of these features contribute to the traffic characteristics. Therefore, eliminating the non-contributing features from the datasets, to facilitate speed and accuracy to the evaluation of machine learning techniques, becomes an important requirement. In this paper we suggest an approach which analyzes the intrusion datasets, evaluates the features for its relevance to a specific attack, determines the level of contribution of feature, and eliminates it from the dataset automatically. We adopt the Rough Set Theory (RST) based approach and select relevance features using multidimensional scatter-plot automatically. A pair-wise feature selection process is adopted to simplify. In our previous research we used KDD'99 dataset and validated the RST based approach. There are lots of redundant data entries in KDD'99 and thus the machine learning techniques are biased towards most occurring events. This property leads the algorithms to ignore less frequent events which can be more harmful than most occurring events. False positives are another important drawback in KDD'99 dataset. In this paper, we adopt NSL-KDD dataset (an improved version of KDD'99 dataset) and validate the automated RST based approach. The approach presented in this paper leads to a selection of most relevance features and we expect that the intrusion detection research using KDD'99-based datasets will benefit from the good understanding of network features and their influences to attacks.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Genetic Feature Selection in Intrusion Detection System
    Han, Myung-Mook
    Kim, Jaehyoun
    Jeong, Taikyeong
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (02): : 493 - 502
  • [2] Sequential Pattern Mining for Intrusion Detection System with Feature Selection on Big Data
    Fidalcastro, A.
    Baburaj, E.
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (10): : 5003 - 5018
  • [3] Feature Selection Algorithms in Intrusion Detection System: A Survey
    Maza, Sofiane
    Touahria, Mohamed
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (10): : 5079 - 5099
  • [4] Unsupervised Feature Selection Method for Intrusion Detection System
    Ambusaidi, Mohammed A.
    He, Xiangjian
    Nanda, Priyadarsi
    [J]. 2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 1, 2015, : 295 - 301
  • [5] A Feature Selection Based DNN for Intrusion Detection System
    Li, Li-Hua
    Ahmad, Ramli
    Tsai, Wen-Chung
    Sharma, Alok Kumar
    [J]. PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [6] An Intrusion Detection System Using Unsupervised Feature Selection
    Suman, Chanchal
    Tripathy, Somanath
    Saha, Sriparna
    [J]. PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 19 - 24
  • [7] A novel feature selection approach for intrusion detection data classification
    Ambusaidi, Mohammed A.
    He, Xiangjian
    Tan, Zhiyuan
    Nanda, Priyadarsi
    Lu, Liang Fu
    Nagar, Upasana T.
    [J]. 2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 82 - 89
  • [8] Combinational Feature Selection Approach for Network Intrusion Detection System
    Garg, Tanya
    Kumar, Yogesh
    [J]. 2014 INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2014, : 82 - 87
  • [9] An Ensemble Intrusion Detection System based on Acute Feature Selection
    Hariprasad, S.
    Deepa, T.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8267 - 8280
  • [10] Differential Evolution Wrapper Feature Selection for Intrusion Detection System
    Almasoudy, Faezah Hamad
    Al-Yaseen, Wathiq Laftah
    Idrees, Ali Kadhum
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1230 - 1239