Semisupervised Learning with Data Augmentation for Raw Network Traffic Detection

被引:0
|
作者
Bhoo, Robin C. [1 ]
Bastian, Nathaniel D. [2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA USA
[2] US Mil Acad, Army Cyber Inst, West Point, NY 10996 USA
关键词
Deep Learning; Semi-supervised Learning; Data Augmentation; Network Intrusion Detection;
D O I
10.1117/12.3013183
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning (DL) has revolutionized machine learning tasks in various domains, but conventional DL methods often demand substantial amounts of labeled data. Semi-supervised learning (SSL) provides an effective solution by incorporating unlabeled data, offering s ignificant advantages in terms of cost and data accessibility. While DL has shown promise with its integration as a component of modern network intrusion detection systems (NIDS), the majority of research in this field f ocuses o n f ully s upervised l earning. However, more recent SSL algorithms leveraging data augmentations do not perform optimally "out of the box" due to the absence of suitable augmentation schemes for packet-level network traffic da ta. Th rough th e in troduction of a novel data augmentation scheme tailored to packet-level network traffic datasets, this paper presents a comprehensive analysis of multiple SSL algorithms for multi-class network traffic detection in a few-shot learning sc enario. We find t hat e ven r elatively s imple a pproaches l ike vanilla p seudo-labeling c an a chieve a n F1-Score t hat i s within 5% of fully supervised learning methods while utilizing less than 2% of the labeled data.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Data-Efficient, Federated Learning for Raw Network Traffic Detection
    Willeke, Mikal R.
    Bierbrauer, David A.
    Bastian, Nathaniel D.
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [2] Transfer learning for raw network traffic detection
    Bierbrauer, David A.
    De Lucia, Michael J.
    Reddy, Krishna
    Maxwell, Paul
    Bastian, Nathaniel D.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
  • [3] Machine Learning for Raw Network Traffic Detection
    De Lucia, Michael
    Maxwell, Paul E.
    Bastian, Nathaniel D.
    Swami, Ananthram
    Jalaian, Brian
    Leslie, Nandi
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS III, 2021, 11746
  • [4] A Deep Learning Approach for Botnet Detection Using Raw Network Traffic Data
    Mohaddeseh Shahhosseini
    Hoda Mashayekhi
    Mohsen Rezvani
    [J]. Journal of Network and Systems Management, 2022, 30
  • [5] A Deep Learning Approach for Botnet Detection Using Raw Network Traffic Data
    Shahhosseini, Mohaddeseh
    Mashayekhi, Hoda
    Rezvani, Mohsen
    [J]. JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2022, 30 (03)
  • [6] Data Anomaly Detection through Semisupervised Learning Aided by Customised Data Augmentation Techniques
    Wang, Xiaoyou
    Du, Yao
    Zhou, Xiaoqing
    Xia, Yong
    [J]. STRUCTURAL CONTROL & HEALTH MONITORING, 2023, 2023
  • [7] Zero-Shot Learning for Raw Network Traffic Detection
    Rani, Pooja
    Bastian, Nathaniel D.
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
  • [8] Vision-Based Detection Method for Construction Site Monitoring by Integrating Data Augmentation and Semisupervised Learning
    Shi, Mengnan
    Chen, Chen
    Xiao, Bo
    Seo, JoonOh
    [J]. JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2024, 150 (05)
  • [9] A Learning-based Data Augmentation for Network Anomaly Detection
    Al Olaimat, Mohammad
    Lee, Dongeun
    Kim, Youngsoo
    Kim, Jonghyun
    Kim, Jinoh
    [J]. 2020 29TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2020), 2020,
  • [10] RawPower: Deep Learning based Anomaly Detection from Raw Network Traffic Measurements
    Marin, Gonzalo
    Casas, Pedro
    Capdehourat, German
    [J]. SIGCOMM'18: PROCEEDINGS OF THE ACM SIGCOMM 2018 CONFERENCE: POSTERS AND DEMOS, 2018, : 75 - 77