A Federated Learning Approach for Anomaly Detection in High Performance Computing

被引:4
|
作者
Farooq, Emmen [1 ]
Borghesi, Andrea [1 ]
机构
[1] Univ Bologna, DISI, Bologna, Italy
关键词
Federated Learning; High Performance Computing; Anomaly Detection; Machine Learning;
D O I
10.1109/ICTAI59109.2023.00079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High Performance Computing (HPC) systems are complex machines that need to be operated at their maximum potential to recoup their investment cost and to mitigate their environmental impact. Anomalous conditions hindering the correct usage of the supercomputing nodes are a significant problem. Hence, the development of automated anomaly detection techniques remains a vital area of research. Machine Learning (ML) models demonstrated to be good at detecting anomalies on individual nodes. However, the potential of combining data from multiple computing nodes and associated ML models has not been explored yet. Federated Learning (FL) can address this shortcoming, by allowing individual models to learn from each other. This paper applies FL to improve the performance of anomaly detection models for HPC systems. The approach has been validated on data from an actual supercomputer, obtaining an improvement in the average f-score from 0.31 to 0.84. We also show how FL can significantly shorten the data collection period needed to create a training set. While ML models need, on average, 4.5 months of training data, FL reduces the training set size to 1.2 weeks - a 15x reduction.
引用
收藏
页码:496 / 500
页数:5
相关论文
共 50 条
  • [21] Federated Learning for Anomaly Detection in Maritime Movement Data
    Graser, Anita
    Weissenfeld, Axel
    Heistracher, Clemens
    Dragaschnig, Melitta
    Widhalm, Peter
    PROCEEDINGS OF THE 2024 25TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT, MDM 2024, 2024, : 77 - 82
  • [22] Support Vector Based Anomaly Detection in Federated Learning
    Frasson, Massimo
    Malchiodi, Dario
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2024, 2024, 2141 : 274 - 287
  • [23] Federated Anomaly Detection
    Zhang, Chunjiong
    Roh, Byeong-hee
    Shan, Gaoyang
    2024 54TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS-SUPPLEMENTAL VOLUME, DSN-S 2024, 2024, : 148 - 149
  • [24] Benchmarking Federated Learning on High-Performance Computing: Aggregation Methods and Their Impact
    Annunziata, Daniela
    Canzaniello, Marzia
    Savoia, Martina
    Cuomo, Salvatore
    Piccialli, Francesco
    2024 32ND EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, PDP 2024, 2024, : 207 - 214
  • [25] A Semi-Supervised Learning Approach for Network Anomaly Detection in Fog Computing
    Xu, Shengjie
    Qian, Yi
    Hu, Rose Qingyang
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [26] A Federated Learning Approach to Pneumonia Detection
    Khan, Saadat Hasan
    Alam, Md Golam Rabiul
    2021 7TH INTERNATIONAL CONFERENCE ON ENGINEERING AND EMERGING TECHNOLOGIES (ICEET 2021), 2021, : 66 - 71
  • [27] Anomaly detection in log-event sequences: A federated deep learning approach and open challenges
    Himler, Patrick
    Landauer, Max
    Skopik, Florian
    Wurzenberger, Markus
    MACHINE LEARNING WITH APPLICATIONS, 2024, 16
  • [28] Differentially Private Federated Learning for Anomaly Detection in eHealth Networks
    Cholakoska, Ana
    Pfitzner, Bjarne
    Gjoreski, Hristijan
    Rakovic, Valentin
    Arnrich, Bert
    Kalendar, Marija
    UBICOMP/ISWC '21 ADJUNCT: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2021 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2021, : 514 - 518
  • [29] Anomaly detection and defense techniques in federated learning: a comprehensive review
    Zhang, Chang
    Yang, Shunkun
    Mao, Lingfeng
    Ning, Huansheng
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (06)
  • [30] Collaborative Anomaly Detection for Internet of Things based on Federated Learning
    Kim, Seongwoo
    Cai, He
    Hua, Cunqing
    Gu, Pengwenlong
    Xu, Wenchao
    Park, Jeonghyeok
    2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 623 - 628