Data-Efficient, Federated Learning for Raw Network Traffic Detection

被引：0

作者：

Willeke, Mikal R. ^{[1
,2
]}

Bierbrauer, David A. ^{[2
]}

Bastian, Nathaniel D. ^{[1
,2
]}

机构：

[1] US Mil Acad, Dept Syst Engn, West Point, NY 10996 USA

[2] US Mil Acad, Army Cyber Inst, West Point, NY 10996 USA

来源：

ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V | 2023年 / 12538卷

关键词：

Federated Learning; Network Intrusion Detection; Internet of Battlefield Things; Data-efficiency; INTRUSION DETECTION; THINGS;

D O I：

10.1117/12.2663092

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Traditional machine learning (ML) models used for enterprise network intrusion detection systems (NIDS) typically rely on vast amounts of centralized data with expertly engineered features. Previous work, however, has shown the feasibility of using deep learning (DL) to detect malicious activity on raw network traffic payloads rather than engineered features at the edge, which is necessary for tactical military environments. In the future Internet of Battlefield Things (IoBT), the military will find itself in multiple environments with disconnected networks spread across the battlefield. These resource-constrained, data-limited networks require distributed and collaborative ML/DL models for inference that are continually trained both locally, using data from each separate tactical edge network, and then globally in order to learn and detect malicious activity represented across the multiple networks in a collaborative fashion. Federated Learning (FL), a collaborative paradigm which updates and distributes a global model through local model weight aggregation, provides a solution to train ML/DL models in NIDS utilizing learning from multiple edge devices from the disparate networks without the sharing of raw data. We develop and experiment with a data-efficient, FL framework for IoBT settings for intrusion detection using only raw network traffic in restricted, resource-limited environments. Our results indicate that regardless of the DL model architecture used on edge devices, the Federated Averaging FL algorithm achieved over 93% accuracy in model performance in detecting malicious payloads after only five episodes of FL training.

引用

页数：16

共 50 条

[31] Data-efficient performance learning for configurable systems
Guo, Jianmei
Yang, Dingyu
Siegmund, Norbert
Apel, Sven
Sarkar, Atrisha
Valov, Pavel
Czarnecki, Krzysztof
Wasowski, Andrzej
Yu, Huiqun
EMPIRICAL SOFTWARE ENGINEERING, 2018, 23 (03) : 1826 - 1867
[32] Elliptic PDE learning is provably data-efficient
Boulle, Nicolas
Halikias, Diana
Townsend, Alex
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (39)
[33] DATA-EFFICIENT MINIMAX QUICKEST CHANGE DETECTION
Banerjee, Taposh
Veeravalli, Venugopal V.
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 3937 - 3940
[34] Decentralized Data-Efficient Quickest Change Detection
Banerjee, Taposh
Veeravalli, Venugopal V.
Tartakovsky, Alexander
2013 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2013, : 2587 - +
[35] Data-Efficient and Interpretable Tabular Anomaly Detection
Chang, Chun-Hao
Yoon, Jinsung
Arik, Sercan O.
Udell, Madeleine
Pfister, Tomas
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 190 - 201
[36] Anomaly Traffic Detection Based on Communication-Efficient Federated Learning in Space-Air-Ground Integration Network
Xu, Haitao
Han, Shuying
Li, Xuhui
Han, Zhu
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9346 - 9360
[37] Anomaly Traffic Detection with Federated Learning toward Network-based Malware Detection in IoT
Nishio, Takayuki
Nakahara, Masataka
Okui, Norihiro
Kubota, Ayumu
Kobayashi, Yasuaki
Sugiyama, Keizo
Shinkuma, Ryoichi
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 299 - 304
[38] Towards Adversarially Robust Data-Efficient Learning with Generated Data
Do, Junhao
Wo, Melvin
Xia, Sihan
Tay, Wei En
2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1422 - 1424
[39] Lossy Compression of Noisy Data for Private and Data-Efficient Learning
Isik B.
Weissman T.
IEEE Journal on Selected Areas in Information Theory, 2022, 3 (04): : 815 - 823
[40] Data-to-Model Distillation: Data-Efficient Learning Framework
Sajedi, Ahmad
Khaki, Samir
Liu, Lucy Z.
Amjadian, Ehsan
Lawryshyn, Yuri A.
Plataniotis, Konstantinos N.
COMPUTER VISION-ECCV 2024, PT XLIII, 2025, 15101 : 438 - 457

← 1 2 3 4 5 →