Investigating rarity in web attacks with ensemble learners

被引:0
|
作者
Richard Zuech
John Hancock
Taghi M. Khoshgoftaar
机构
[1] Florida Atlantic University,
来源
关键词
Rarity; CSE-CIC-IDS2018; Intrusion detection; Web attacks; Class imbalance; Random undersampling; Big data; Ensemble learners;
D O I
暂无
中图分类号
学科分类号
摘要
Class rarity is a frequent challenge in cybersecurity. Rarity occurs when the positive (attack) class only has a small number of instances for machine learning classifiers to train upon, thus making it difficult for the classifiers to discriminate and learn from the positive class. To investigate rarity, we examine three individual web attacks in big data from the CSE-CIC-IDS2018 dataset: “Brute Force-Web”, “Brute Force-XSS”, and “SQL Injection”. These three individual web attacks are also severely imbalanced, and so we evaluate whether random undersampling (RUS) treatments can improve the classification performance for these three individual web attacks. The following eight different levels of RUS ratios are evaluated: no sampling, 999:1, 99:1, 95:5, 9:1, 3:1, 65:35, and 1:1. For measuring classification performance, Area Under the Receiver Operating Characteristic Curve (AUC) metrics are obtained for the following seven different classifiers: Random Forest (RF), CatBoost (CB), LightGBM (LGB), XGBoost (XGB), Decision Tree (DT), Naive Bayes (NB), and Logistic Regression (LR) (with the first four learners being ensemble learners and for comparison, the last three being single learners). We find that applying random undersampling does improve overall classification performance with the AUC metric in a statistically significant manner. Ensemble learners achieve the top AUC scores after massive undersampling is applied, but the ensemble learners break down and have poor performance (worse than NB and DT) when no sampling is applied to our unique and harsh experimental conditions of severe class imbalance and rarity.
引用
收藏
相关论文
共 50 条
  • [21] A taxonomy of web attacks
    Alvarez, G
    Petrovic, S
    [J]. WEB ENGINEERING, PROCEEDINGS, 2003, 2722 : 295 - 298
  • [22] Efficient ensemble to combat flash attacks
    Kumar, Om C. U.
    Bhama, Ponsy R. K. Sathia
    [J]. COMPUTATIONAL INTELLIGENCE, 2024, 40 (01)
  • [23] Ensemble Methods to Detect XSS Attacks
    Nagarjun, P. M. D.
    Ahamad, Shaik Shakeel
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 695 - 700
  • [24] Investigating the Writing Achievement of Deaf Learners
    Mayer, Connie
    Trezek, Beverly J.
    [J]. AMERICAN ANNALS OF THE DEAF, 2023, 167 (05) : 625 - 643
  • [25] INVESTIGATING THE USE OF FACEBOOK BY DEAF LEARNERS
    Goodoory, K.
    [J]. EDULEARN15: 7TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2015, : 1980 - 1988
  • [26] Investigating feedback orientation in medical learners
    Mills, Lynnea M.
    O'Sullivan, Patricia S.
    ten Cate, Olle
    Boscardin, Christy
    [J]. MEDICAL TEACHER, 2023, 45 (05) : 492 - 498
  • [27] Web Services for Supporting the Interactions of Learners in the Social Web
    Rebedea, Traian
    Dascalu, Mihai
    Posea, Vlad
    Trausan-Matu, Stefan
    [J]. 9TH ROEDUNET IEEE INTERNATIONAL CONFERENCE, 2010, : 128 - 133
  • [28] EVALUATION OF ROBUSTNESS OF ENSEMBLE LEARNERS TO NOISY DATA
    Albayrak, Abdulkadir
    Cingiz, M. Ozgur
    Amasyali, M. Fatih
    [J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [29] An ensemble technique for stable learners with performance bounds
    Davidson, I
    [J]. PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 330 - 335
  • [30] Detecting IoT Botnet Attacks through Ensemble and Meta Ensemble Approaches
    Ma, Xiangjun
    He, Jingsha
    Nazir, Ahsan
    Zhu, Nafei
    Hu, Xiao
    Ullah, Faheem
    Wajahat, Ahsan
    Luo, Yehong
    Qureshi, Sirajuddin
    [J]. International Journal of Network Security, 2024, 26 (05): : 885 - 900