Multicenter Hierarchical Federated Learning With Fault-Tolerance Mechanisms for Resilient Edge Computing Networks

被引:4
|
作者
Chen, Xiaohong [1 ]
Xu, Guanying [1 ]
Xu, Xuesong [2 ]
Jiang, Haichong [3 ]
Tian, Zhiping [3 ]
Ma, Tao [4 ]
机构
[1] Cent South Univ, Xiang Jiang Lab, Business Sch, Changsha 410083, Peoples R China
[2] Hunan Univ Technol & Business, Changsha Social Lab Artificial Intelligence, Changsha 410205, Peoples R China
[3] Hunan Univ Technol & Business, Sch Adv Interdisciplinary Studies, Changsha 410205, Peoples R China
[4] Hope Innovat Co Ltd, Changsha 410205, Peoples R China
基金
中国国家自然科学基金;
关键词
Servers; Training; Computational modeling; Computer architecture; Fault tolerant systems; Fault tolerance; Real-time systems; federated learning (FL); hierarchical FL (HFL); multicenter; STOCHASTIC GRADIENT DESCENT; RESOURCE-ALLOCATION; INTELLIGENCE;
D O I
10.1109/TNNLS.2024.3362974
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the realm of federated learning (FL), the conventional dual-layered architecture, comprising a central parameter server and peripheral devices, often encounters challenges due to its significant reliance on the central server for communication and security. This dependence becomes particularly problematic in scenarios involving potential malfunctions of devices and servers. While existing device-edge-cloud hierarchical FL (HFL) models alleviate some dependence on central servers and reduce communication overheads, they primarily focus on load balancing within edge computing networks and fall short of achieving complete decentralization and edge-centric model aggregation. Addressing these limitations, we introduce the multicenter HFL (MCHFL) framework. This innovative framework replaces the traditional single central server architecture with a distributed network of robust global aggregation centers located at the edge, inherently enhancing fault tolerance crucial for maintaining operational integrity amidst edge network disruptions. Our comprehensive experiments with the MNIST, FashionMNIST, and CIFAR-10 datasets demonstrate the MCHFL's superior performance. Notably, even under high paralysis ratios of up to 50%, the MCHFL maintains high accuracy levels, with maximum accuracy reductions of only 2.60%, 5.12%, and 16.73% on these datasets, respectively. This performance significantly surpasses the notable accuracy declines observed in traditional single-center models under similar conditions. To the best of our knowledge, the MCHFL is the first edge multicenter FL framework with theoretical underpinnings. Our extensive experimental results across various datasets validate the MCHFL's effectiveness, showcasing its higher accuracy, faster convergence speed, and stronger robustness compared to single-center models, thereby establishing it as a pioneering paradigm in edge multicenter FL.
引用
收藏
页码:47 / 61
页数:15
相关论文
共 50 条
  • [41] Efficient federated learning for fault diagnosis in industrial cloud-edge computing
    Wang, Qizhao
    Li, Qing
    Wang, Kai
    Wang, Hong
    Zeng, Peng
    COMPUTING, 2021, 103 (10) : 2319 - 2337
  • [42] METHODS AND MODELS FOR COMPUTING SURVIVABILITY AND FAULT-TOLERANCE OF A NETWORK
    GAGIN, AA
    MICROELECTRONICS AND RELIABILITY, 1993, 33 (10): : 1533 - 1552
  • [43] Fault-tolerance of (n, k)-star networks
    Li, Xiang-Jun
    Xu, Jun-Ming
    APPLIED MATHEMATICS AND COMPUTATION, 2014, 248 : 525 - 530
  • [44] A mutual information based federated learning framework for edge computing networks
    Chen, Naiyue
    Li, Yinglong
    Liu, Xuejun
    Zhang, Zhenjiang
    COMPUTER COMMUNICATIONS, 2021, 176 (176) : 23 - 30
  • [45] Efficient federated learning for fault diagnosis in industrial cloud-edge computing
    Qizhao Wang
    Qing Li
    Kai Wang
    Hong Wang
    Peng Zeng
    Computing, 2021, 103 : 2319 - 2337
  • [46] Fault-tolerance in distance-edge-monitoring sets
    Yang, Chenxu
    Mao, Yaping
    Klasing, Ralf
    Yang, Gang
    Xiao, Yuzhi
    Zhang, Xiaoyan
    ACTA INFORMATICA, 2025, 62 (01)
  • [47] The Structure Fault-Tolerance of Enhanced Hypercube Networks
    Jin, Dan
    Liu, Hong-mei
    2018 INTERNATIONAL CONFERENCE ON ELECTRICAL, CONTROL, AUTOMATION AND ROBOTICS (ECAR 2018), 2018, 307 : 235 - 238
  • [48] ON THE FAULT-TOLERANCE AND SIZE OF WDM OPTICAL NETWORKS
    Ceres, Pietro
    Ceres, Raffaele
    PARALLEL PROCESSING LETTERS, 2011, 21 (01) : 3 - 12
  • [49] FAULT-TOLERANCE IN MULTICHANNEL LOCAL AREA NETWORKS
    CAMARDA, P
    GERLA, M
    EIGHTH ANNUAL INTERNATIONAL PHOENIX CONFERENCE ON COMPUTERS AND COMMUNICATIONS: 1989 CONFERENCE PROCEEDINGS, 1989, : 133 - 137
  • [50] PERFORMANCE AND FAULT-TOLERANCE OF NEURAL NETWORKS FOR OPTIMIZATION
    PROTZEL, PW
    PALUMBO, DL
    ARRAS, MK
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (04): : 600 - 614