A Federated Learning Approach for Anomaly Detection in High Performance Computing

被引:4
|
作者
Farooq, Emmen [1 ]
Borghesi, Andrea [1 ]
机构
[1] Univ Bologna, DISI, Bologna, Italy
关键词
Federated Learning; High Performance Computing; Anomaly Detection; Machine Learning;
D O I
10.1109/ICTAI59109.2023.00079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High Performance Computing (HPC) systems are complex machines that need to be operated at their maximum potential to recoup their investment cost and to mitigate their environmental impact. Anomalous conditions hindering the correct usage of the supercomputing nodes are a significant problem. Hence, the development of automated anomaly detection techniques remains a vital area of research. Machine Learning (ML) models demonstrated to be good at detecting anomalies on individual nodes. However, the potential of combining data from multiple computing nodes and associated ML models has not been explored yet. Federated Learning (FL) can address this shortcoming, by allowing individual models to learn from each other. This paper applies FL to improve the performance of anomaly detection models for HPC systems. The approach has been validated on data from an actual supercomputer, obtaining an improvement in the average f-score from 0.31 to 0.84. We also show how FL can significantly shorten the data collection period needed to create a training set. While ML models need, on average, 4.5 months of training data, FL reduces the training set size to 1.2 weeks - a 15x reduction.
引用
收藏
页码:496 / 500
页数:5
相关论文
共 50 条
  • [11] Network Anomaly Detection Using Federated Learning
    Marfo, William
    Tosh, Deepak K.
    Moore, Shirley V.
    2022 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM), 2022,
  • [12] Anomaly Detection using Distributed Log Data: A Lightweight Federated Learning Approach
    Guo, Yalan
    Wu, Yulei
    Zhu, Yanchao
    Yang, Bingqiang
    Han, Chunjing
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [13] Distributed Anomaly Detection in Smart Grids: A Federated Learning-Based Approach
    Jithish, J.
    Alangot, Bithin
    Mahalingam, Nagarajan
    Yeo, Kiat Seng
    IEEE ACCESS, 2023, 11 : 7157 - 7179
  • [14] A Federated Learning Approach for Efficient Anomaly Detection in Electric Power Steering Systems
    Kea, Kimleang
    Han, Youngsun
    Min, Young-Jae
    IEEE ACCESS, 2024, 12 : 67525 - 67536
  • [15] Anomaly Detection Using Autoencoders in High Performance Computing Systems
    Borghesi, Andrea
    Bartolini, Andrea
    Lombardi, Michele
    Milano, Michela
    Benini, Luca
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9428 - 9433
  • [16] Intrusion Detection Using Federated Learning for Computing
    Aashmi R.S.
    Jaya T.
    Computer Systems Science and Engineering, 2023, 45 (02): : 1295 - 1308
  • [17] Detecting cyberattacks using anomaly detection in industrial control systems: A Federated Learning approach
    Huong, Truong Thu
    Bac, Ta Phuong
    Long, Dao Minh
    Luong, Tran Duc
    Dan, Nguyen Minh
    Quang, Le Anh
    Cong, Le Thanh
    Thang, Bui Doan
    Tran, Kim Phuc
    COMPUTERS IN INDUSTRY, 2021, 132 (132)
  • [18] Enhancing Robustness in Federated Learning by Supervised Anomaly Detection
    Quan, Pengrui
    Lee, Wei-Han
    Srivatsa, Mudhakar
    Srivastava, Mani
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 996 - 1003
  • [19] Harnessing federated learning for anomaly detection in supercomputer nodes
    Farooq, Emmen
    Milano, Michela
    Borghesi, Andrea
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 161 : 673 - 685
  • [20] Federated deep learning for anomaly detection in the internet of things
    Wang, Xiaofeng
    Wang, Yonghong
    Javaheri, Zahra
    Almutairi, Laila
    Moghadamnejad, Navid
    Younes, Osama S.
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 108