A Federated Learning Approach for Anomaly Detection in High Performance Computing

被引:4
|
作者
Farooq, Emmen [1 ]
Borghesi, Andrea [1 ]
机构
[1] Univ Bologna, DISI, Bologna, Italy
关键词
Federated Learning; High Performance Computing; Anomaly Detection; Machine Learning;
D O I
10.1109/ICTAI59109.2023.00079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High Performance Computing (HPC) systems are complex machines that need to be operated at their maximum potential to recoup their investment cost and to mitigate their environmental impact. Anomalous conditions hindering the correct usage of the supercomputing nodes are a significant problem. Hence, the development of automated anomaly detection techniques remains a vital area of research. Machine Learning (ML) models demonstrated to be good at detecting anomalies on individual nodes. However, the potential of combining data from multiple computing nodes and associated ML models has not been explored yet. Federated Learning (FL) can address this shortcoming, by allowing individual models to learn from each other. This paper applies FL to improve the performance of anomaly detection models for HPC systems. The approach has been validated on data from an actual supercomputer, obtaining an improvement in the average f-score from 0.31 to 0.84. We also show how FL can significantly shorten the data collection period needed to create a training set. While ML models need, on average, 4.5 months of training data, FL reduces the training set size to 1.2 weeks - a 15x reduction.
引用
收藏
页码:496 / 500
页数:5
相关论文
共 50 条
  • [31] Identifying Backdoor Attacks in Federated Learning via Anomaly Detection
    Mi, Yuxi
    Sun, Yiheng
    Guan, Jihong
    Zhou, Shuigeng
    WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 111 - 126
  • [32] Trust-Based Anomaly Detection in Federated Edge Learning
    Zatsarenko, Raman
    Chuprov, Sergei
    Korobeinikov, Dmitrii
    Reznik, Leon
    2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024, 2024, : 0273 - 0279
  • [33] Federated disentangled representation learning for unsupervised brain anomaly detection
    Cosmin I. Bercea
    Benedikt Wiestler
    Daniel Rueckert
    Shadi Albarqouni
    Nature Machine Intelligence, 2022, 4 : 685 - 695
  • [34] Personalized federated learning framework for network traffic anomaly detection
    Pei, Jiaming
    Zhong, Kaiyang
    Jan, Mian Ahmad
    Li, Jinhai
    COMPUTER NETWORKS, 2022, 209
  • [35] Federated Learning with Anomaly Client Detection and Decentralized Parameter Aggregation
    Shu Liu
    Shang, Yanlei
    52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOP VOLUME (DSN-W 2022), 2022, : 37 - 43
  • [36] Federated Variational Learning for Anomaly Detection in Multivariate Time Series
    Zhang, Kai
    Jiang, Yushan
    Seversky, Lee
    Xu, Chengtao
    Liu, Dahai
    Song, Houbing
    2021 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE (IPCCC), 2021,
  • [37] POSTER: Decentralized Federated Learning for Internet of Things Anomaly Detection
    Lian, Zhuotao
    Su, Chunhua
    ASIA CCS'22: PROCEEDINGS OF THE 2022 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2022, : 1249 - 1251
  • [38] Trust-based federated learning for network anomaly detection
    Chen, Naiyue
    Jin, Yi
    Li, Yinglong
    Cai, Luxin
    WEB INTELLIGENCE, 2021, 19 (04) : 317 - 327
  • [39] Federated disentangled representation learning for unsupervised brain anomaly detection
    Bercea, Cosmin, I
    Wiestler, Benedikt
    Rueckert, Daniel
    Albarqouni, Shadi
    NATURE MACHINE INTELLIGENCE, 2022, 4 (08) : 685 - +
  • [40] Performance Analysis of Federated Learning in Orbital Edge Computing
    Jabbarpour, Mohammad Reza
    Javadi, Bahman
    Leong, Philip H. W.
    Calheiros, Rodrigo N.
    Boland, David
    Butler, Chris
    16TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC 2023, 2023,