Data-Efficient, Federated Learning for Raw Network Traffic Detection

被引:0
|
作者
Willeke, Mikal R. [1 ,2 ]
Bierbrauer, David A. [2 ]
Bastian, Nathaniel D. [1 ,2 ]
机构
[1] US Mil Acad, Dept Syst Engn, West Point, NY 10996 USA
[2] US Mil Acad, Army Cyber Inst, West Point, NY 10996 USA
关键词
Federated Learning; Network Intrusion Detection; Internet of Battlefield Things; Data-efficiency; INTRUSION DETECTION; THINGS;
D O I
10.1117/12.2663092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional machine learning (ML) models used for enterprise network intrusion detection systems (NIDS) typically rely on vast amounts of centralized data with expertly engineered features. Previous work, however, has shown the feasibility of using deep learning (DL) to detect malicious activity on raw network traffic payloads rather than engineered features at the edge, which is necessary for tactical military environments. In the future Internet of Battlefield Things (IoBT), the military will find itself in multiple environments with disconnected networks spread across the battlefield. These resource-constrained, data-limited networks require distributed and collaborative ML/DL models for inference that are continually trained both locally, using data from each separate tactical edge network, and then globally in order to learn and detect malicious activity represented across the multiple networks in a collaborative fashion. Federated Learning (FL), a collaborative paradigm which updates and distributes a global model through local model weight aggregation, provides a solution to train ML/DL models in NIDS utilizing learning from multiple edge devices from the disparate networks without the sharing of raw data. We develop and experiment with a data-efficient, FL framework for IoBT settings for intrusion detection using only raw network traffic in restricted, resource-limited environments. Our results indicate that regardless of the DL model architecture used on edge devices, the Federated Averaging FL algorithm achieved over 93% accuracy in model performance in detecting malicious payloads after only five episodes of FL training.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Data-efficient performance learning for configurable systems
    Guo, Jianmei
    Yang, Dingyu
    Siegmund, Norbert
    Apel, Sven
    Sarkar, Atrisha
    Valov, Pavel
    Czarnecki, Krzysztof
    Wasowski, Andrzej
    Yu, Huiqun
    EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (03) : 1826 - 1867
  • [32] Elliptic PDE learning is provably data-efficient
    Boulle, Nicolas
    Halikias, Diana
    Townsend, Alex
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (39)
  • [33] DATA-EFFICIENT MINIMAX QUICKEST CHANGE DETECTION
    Banerjee, Taposh
    Veeravalli, Venugopal V.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 3937 - 3940
  • [34] Decentralized Data-Efficient Quickest Change Detection
    Banerjee, Taposh
    Veeravalli, Venugopal V.
    Tartakovsky, Alexander
    2013 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2013, : 2587 - +
  • [35] Data-Efficient and Interpretable Tabular Anomaly Detection
    Chang, Chun-Hao
    Yoon, Jinsung
    Arik, Sercan O.
    Udell, Madeleine
    Pfister, Tomas
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 190 - 201
  • [36] Anomaly Traffic Detection Based on Communication-Efficient Federated Learning in Space-Air-Ground Integration Network
    Xu, Haitao
    Han, Shuying
    Li, Xuhui
    Han, Zhu
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9346 - 9360
  • [37] Anomaly Traffic Detection with Federated Learning toward Network-based Malware Detection in IoT
    Nishio, Takayuki
    Nakahara, Masataka
    Okui, Norihiro
    Kubota, Ayumu
    Kobayashi, Yasuaki
    Sugiyama, Keizo
    Shinkuma, Ryoichi
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 299 - 304
  • [38] Towards Adversarially Robust Data-Efficient Learning with Generated Data
    Do, Junhao
    Wo, Melvin
    Xia, Sihan
    Tay, Wei En
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1422 - 1424
  • [39] Lossy Compression of Noisy Data for Private and Data-Efficient Learning
    Isik B.
    Weissman T.
    IEEE Journal on Selected Areas in Information Theory, 2022, 3 (04): : 815 - 823
  • [40] Data-to-Model Distillation: Data-Efficient Learning Framework
    Sajedi, Ahmad
    Khaki, Samir
    Liu, Lucy Z.
    Amjadian, Ehsan
    Lawryshyn, Yuri A.
    Plataniotis, Konstantinos N.
    COMPUTER VISION-ECCV 2024, PT XLIII, 2025, 15101 : 438 - 457