Dealing with Imbalanced Data in Multi-class Network Intrusion Detection Systems Using XGBoost

被引:1
|
作者
AL-Essa, Malik [1 ]
Appice, Annalisa [1 ,2 ]
机构
[1] Univ Bari Aldo Moro, Dipartimento Informat, Via Orabona 4, I-70126 Bari, Italy
[2] Consorzio Interuniv Nazl Informat CINI, Bari, Italy
关键词
Network intrusion detection; Imbalanced classification; Oversampling; Feature selection; Multi-class classification;
D O I
10.1007/978-3-030-93733-1_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network intrusion detection is a crucial cyber-security problem, where machine learning is recognised as a relevant approach to detect signs of malicious activity in the network traffic. However, intrusion detection patterns learned with imbalanced network traffic data often fail in recognizing rare attacks. One way to address this issue is to use oversampling before learning, in order to adjust the ratio between the different classes and make the traffic data more balanced. This paper investigates the effect of oversampling coupled to feature selection, in order to understand how the feature relevance may change due to the creation of artificial rare samples. We perform this study using XGBoost for the network traffic classification. The experiments are performed with two benchmark multi-class network intrusion detection problems.
引用
下载
收藏
页码:5 / 21
页数:17
相关论文
共 50 条
  • [1] An FPA-Optimized XGBoost Stacking for Multi-Class Imbalanced Network Attack Detection
    Soon, Hui Fern
    Amir, Amiza
    Nishizaki, Hiromitsu
    Zahri, Nik Adilah Hanin
    Kamarudin, Latifah Munirah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 1380 - 1390
  • [2] Balanced Multi-Class Network Intrusion Detection Using Machine Learning
    Khan, Faraz Ahmad
    Shah, Asghar Ali
    Alshammry, Nizal
    Saif, Saifullah
    Khan, Wasim
    Malik, Muhammad Osama
    Ullah, Zahid
    IEEE Access, 2024, 12 : 178222 - 178236
  • [3] An Approach for the Application of a Dynamic Multi-Class Classifier for Network Intrusion Detection Systems
    Larriva-Novo, Xavier
    Sanchez-Zas, Carmen
    Villagra, Victor A.
    Vega-Barbas, Mario
    Rivera, Diego
    ELECTRONICS, 2020, 9 (11) : 1 - 18
  • [4] Multi-class Boosting for Imbalanced Data
    Fernandez-Baldera, Antonio
    Buenaposada, Jose M.
    Baumela, Luis
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 57 - 64
  • [5] Extreme minority class detection in imbalanced data for network intrusion
    Milosevic, Marija S.
    Ciric, Vladimir M.
    COMPUTERS & SECURITY, 2022, 123
  • [6] Multi-class WHMBoost: An ensemble algorithm for multi-class imbalanced data
    Zhao, Jiakun
    Jin, Ju
    Zhang, Yibo
    Zhang, Ruifeng
    Chen, Si
    INTELLIGENT DATA ANALYSIS, 2022, 26 (03) : 599 - 614
  • [7] Study of Multi-Class Classification Algorithms' Performance on Highly Imbalanced Network Intrusion Datasets
    Bulavas, Viktoras
    Marcinkevicius, Virginijus
    Ruminski, Jacek
    INFORMATICA, 2021, 32 (03) : 441 - 475
  • [8] Concept Drift Detection from Multi-Class Imbalanced Data Streams
    Korycki, Lukasz
    Krawczyk, Bartosz
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1068 - 1079
  • [9] Evaluating Difficulty of Multi-class Imbalanced Data
    Lango, Mateusz
    Napierala, Krystyna
    Stefanowski, Jerzy
    FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2017, 2017, 10352 : 312 - 322
  • [10] Two Layers Multi-class Detection Method for Network Intrusion Detection System
    Yuan, Yali
    Huo, Liuwei
    Hogrefe, Dieter
    2017 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2017, : 767 - 772