A Federated Learning Approach for Anomaly Detection in High Performance Computing

被引:4
|
作者
Farooq, Emmen [1 ]
Borghesi, Andrea [1 ]
机构
[1] Univ Bologna, DISI, Bologna, Italy
关键词
Federated Learning; High Performance Computing; Anomaly Detection; Machine Learning;
D O I
10.1109/ICTAI59109.2023.00079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High Performance Computing (HPC) systems are complex machines that need to be operated at their maximum potential to recoup their investment cost and to mitigate their environmental impact. Anomalous conditions hindering the correct usage of the supercomputing nodes are a significant problem. Hence, the development of automated anomaly detection techniques remains a vital area of research. Machine Learning (ML) models demonstrated to be good at detecting anomalies on individual nodes. However, the potential of combining data from multiple computing nodes and associated ML models has not been explored yet. Federated Learning (FL) can address this shortcoming, by allowing individual models to learn from each other. This paper applies FL to improve the performance of anomaly detection models for HPC systems. The approach has been validated on data from an actual supercomputer, obtaining an improvement in the average f-score from 0.31 to 0.84. We also show how FL can significantly shorten the data collection period needed to create a training set. While ML models need, on average, 4.5 months of training data, FL reduces the training set size to 1.2 weeks - a 15x reduction.
引用
收藏
页码:496 / 500
页数:5
相关论文
共 50 条
  • [41] Anomaly Detection Fog (ADF): A federated approach for internet of things
    Behniafar, M.
    Mahjur, A.
    Nowroozi, A.
    SCIENTIA IRANICA, 2023, 30 (02) : 465 - 476
  • [42] Markov Chain Modeling for Anomaly Detection in High Performance Computing System Logs
    Haque, Abida
    DeLucia, Alexandra
    Baseman, Elisabeth
    HUST'17: PROCEEDINGS OF THE FOURTH INTERNATIONAL WORKSHOP ON HPC USER SUPPORT TOOLS, 2017,
  • [43] On the Dynamics of Non-IID Data in Federated Learning and High-Performance Computing
    Annunziata, Daniela
    Canzaniello, Marzia
    Chiaro, Diletta
    Izzo, Stefano
    Savoia, Martina
    Piccialli, Francesco
    2024 32ND EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PDP 2024, 2024, : 230 - 237
  • [44] Chained Anomaly Detection Models for Federated Learning: An Intrusion Detection Case Study
    Preuveneers, Davy
    Rimmer, Vera
    Tsingenopoulos, Ilias
    Spooren, Jan
    Joosen, Wouter
    Ilie-Zudor, Elisabeth
    APPLIED SCIENCES-BASEL, 2018, 8 (12):
  • [45] A proficient approach for face detection and recognition using machine learning and high-performance computing
    Singh, Astha
    Prakash, Shiv
    Kumar, Ankit
    Kumar, Divya
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (03):
  • [46] AI-Empowered Trajectory Anomaly Detection for Intelligent Transportation Systems: A Hierarchical Federated Learning Approach
    Wang, Xiaoding
    Liu, Wenxin
    Lin, Hui
    Hu, Jia
    Kaur, Kuljeet
    Hossain, M. Shamim
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) : 4631 - 4640
  • [47] Unified ensemble federated learning with cloud computing for online anomaly detection in energy-efficient wireless sensor networks
    Gayathri, S.
    Surendran, D.
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2024, 13 (01):
  • [48] Unified ensemble federated learning with cloud computing for online anomaly detection in energy-efficient wireless sensor networks
    S. Gayathri
    D. Surendran
    Journal of Cloud Computing, 13
  • [49] High Dimensional Computing Approach to Detection and Learning Gesture Biometrics
    Liu, Eric
    Casey, William
    Melaragno, Anthony
    INTELLIGENT COMPUTING, VOL 4, 2024, 2024, 1019 : 551 - 565
  • [50] Anomaly Detection from Distributed Data Sources via Federated Learning
    Cavallin, Florencia
    Mayer, Rudolf
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 2, 2022, 450 : 317 - 328