Automatic Dataset Labelling and Feature Selection for Intrusion Detection Systems

被引:17
|
作者
Aparicio-Navarro, Francisco J. [1 ]
Kyriakopoulos, Konstantinos G. [1 ]
Parish, David J. [1 ]
机构
[1] Univ Loughborough, Sch Elect Elect & Syst Engn, Loughborough LE11 3TU, Leics, England
基金
英国工程与自然科学研究理事会;
关键词
Automatic Labelling; Network Traffic Labelling; Unsupervised Anomaly IDS; Feature Selection; Genetic Algorithm;
D O I
10.1109/MILCOM.2014.17
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Correctly labelled datasets are commonly required. Three particular scenarios are highlighted, which showcase this need. When using supervised Intrusion Detection Systems (IDSs), these systems need labelled datasets to be trained. Also, the real nature of the analysed datasets must be known when evaluating the efficiency of the IDSs when detecting intrusions. Another scenario is the use of feature selection that works only if the processed datasets are labelled. In normal conditions, collecting labelled datasets from real networks is impossible. Currently, datasets are mainly labelled by implementing off-line forensic analysis, which is impractical because it does not allow real-time implementation. We have developed a novel approach to automatically generate labelled network traffic datasets using an unsupervised anomaly based IDS. The resulting labelled datasets are subsets of the original unlabelled datasets. The labelled dataset is then processed using a Genetic Algorithm (GA) based approach, which performs the task of feature selection. The GA has been implemented to automatically provide the set of metrics that generate the most appropriate intrusion detection results.
引用
下载
收藏
页码:46 / 51
页数:6
相关论文
共 50 条
  • [41] Feature Detection with Automatic Scale Selection
    Tony Lindeberg
    International Journal of Computer Vision, 1998, 30 : 79 - 116
  • [42] Automatic recommendation of feature selection algorithms based on dataset characteristics
    Sabino Parmezan, Antonio Rafael
    Lee, Huei Diana
    Spolaor, Newton
    Wu, Feng Chung
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185
  • [43] Empirical Study of Automatic Dataset Labelling
    Aparicio-Navarro, Francisco J.
    Kyriakopoulos, Konstantinos G.
    Parish, David J.
    2014 9TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2014, : 372 - 378
  • [44] A selection criteria for intrusion detection systems
    Amoroso, E
    Kwapniewski, R
    14TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, PROCEEDINGS, 1998, : 280 - 288
  • [45] An effective genetic algorithm-based feature selection method for intrusion detection systems
    Halim, Zahid
    Yousaf, Muhammad Nadeem
    Waqas, Muhammad
    Sulaiman, Muhammad
    Abbas, Ghulam
    Hussain, Masroor
    Ahmad, Iftekhar
    Hanif, Muhammad
    COMPUTERS & SECURITY, 2021, 110
  • [46] Feature selection using rough set in intrusion detection
    Zainal, Anazida
    Maarof, Mohd Aizaini
    Shamsuddin, Siti Mariyam
    TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 2026 - +
  • [47] Application of feature selection and fuzzy ARTMAP to intrusion detection
    Vilakazi, Christina B.
    Marwala, Tshilidzi
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4880 - +
  • [48] A Cascaded Feature Selection Approach in Network Intrusion Detection
    Sun, Yong
    Liu, Feng
    2015 WORLD CONGRESS ON INTERNET SECURITY (WORLDCIS), 2015, : 119 - 124
  • [49] A feature selection algorithm towards efficient intrusion detection
    Yin, Chunyong
    Ma, Luyu
    Feng, Lu
    Yin, Zhichao
    Wang, Jin
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (11): : 253 - 264
  • [50] A Fusion of Feature Extraction and Feature Selection Technique for Network Intrusion Detection
    Hamid, Yasir
    Sugumaran, M.
    Journaux, Ludovic
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (08): : 151 - 158