In-Network Computation for Large-Scale Federated Learning Over Wireless Edge Networks

被引:11
|
作者
Dinh, Thinh Quang [1 ]
Nguyen, Diep N. [1 ]
Hoang, Dinh Thai [1 ]
Pham, Tran Vu [2 ]
Dutkiewicz, Eryk [1 ]
机构
[1] Univ Technol Sydney, Sch Elect & Data Engn, Ultimo, NSW 2007, Australia
[2] Ho Chi Minh City Univ Technol HCMUT, VNU HCM, Ho Chi Minh City 70000, Vietnam
基金
澳大利亚研究理事会;
关键词
Computational modeling; Servers; Routing; Training; Network architecture; Machine learning; Stars; Mobile edge computing; federated learning; in-network computation; large-scale distributed learning;
D O I
10.1109/TMC.2022.3190260
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most conventional Federated Learning (FL) models are using a star network topology where all users aggregate their local models at a single server (e.g., a cloud server). That causes significant overhead in terms of both communications and computing at the server, delaying the training process, especially for large scale FL systems with straggling nodes. This article proposes a novel edge network architecture that enables decentralizing the model aggregation process at the server, thereby significantly reducing the training delay for the whole FL network. Specifically, we design a highly-effective in-network computation framework (INC) consisting of a user scheduling mechanism, an in-network aggregation process (INA) which is designed for both primal- and primal-dual methods in distributed machine learning problems, and a network routing algorithm with theoretical performance bounds. The in-network aggregation process, which is implemented at edge nodes and cloud node, can adapt two typical methods to allow edge networks to effectively solve the distributed machine learning problems. Under the proposed INA, we then formulate a joint routing and resource optimization problem, aiming to minimize the aggregation latency. The problem turns out to be NP-hard, and thus we propose a polynomial time routing algorithm which can achieve near optimal performance with a theoretical bound. Simulation results showed that the proposed algorithm can achieve more than 99% of the optimal solution and reduce the FL training latency, up to 5.6 times w.r.t other baselines. The proposed INC framework can not only help reduce the FL training latency but also significantly decrease cloud's traffic and computing overhead. By embedding the computing/aggregation tasks at the edge nodes and leveraging the multi-layer edge-network architecture, the INC framework can liberate FL from the star topology to enable large-scale FL.
引用
收藏
页码:5918 / 5932
页数:15
相关论文
共 50 条
  • [41] Federated Learning Over Wireless Networks: Challenges and Solutions
    Beitollahi, Mahdi
    Lu, Ning
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (16) : 14749 - 14763
  • [42] Distributed network configuration in large-scale low power wireless networks
    Kim, Hyung-Sin
    Bang, Jae-Seok
    Lee, Yong-Hwan
    COMPUTER NETWORKS, 2014, 70 : 288 - 301
  • [43] Asynchronous Federated Learning Over Wireless Communication Networks
    Wang, Zhongyu
    Zhang, Zhaoyang
    Tian, Yuqing
    Yang, Qianqian
    Shan, Hangguan
    Wang, Wei
    Quek, Tony Q. S.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (09) : 6961 - 6978
  • [44] Low Latency Federated Learning over Wireless Edge Networks via Efficient Bandwidth Allocation
    Kushwaha, Deepali
    Redhu, Surender
    Hegde, Rajesh M.
    2022 IEEE 8TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2022,
  • [45] Debiased Device Sampling for Federated Edge Learning in Wireless Networks
    Chen, Siguang
    Li, Qun
    Shi, Yanhang
    Li, Xue
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (02) : 709 - 721
  • [46] Intrusion Detection for Wireless Edge Networks Based on Federated Learning
    Chen, Zhuo
    Lv, Na
    Liu, Pengfei
    Fang, Yu
    Chen, Kun
    Pan, Wu
    IEEE ACCESS, 2020, 8 (08): : 217463 - 217472
  • [47] Spanning Edge Centrality: Large-scale Computation and Applications
    Mavroforakis, Charalampos
    Garcia-Lebron, Richard
    Koutis, Ioannis
    Terzi, Evimaria
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 732 - 742
  • [48] Machine Learning Based Localization in Large-Scale Wireless Sensor Networks
    Bhatti, Ghulam
    SENSORS, 2018, 18 (12)
  • [49] NetDP: In-Network Differential Privacy for Large-Scale Data Processing
    Zhou, Zhengyan
    Chen, Hanze
    Chen, Lingfei
    Zhang, Dong
    Wu, Chunming
    Liu, Xuan
    Khan, Muhammad Khurram
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2024, 8 (03): : 1076 - 1089
  • [50] Large-Scale Secure XGB for Vertical Federated Learning
    Fang, Wenjing
    Zhao, Derun
    Tan, Jin
    Chen, Chaochao
    Yu, Chaofan
    Wang, Li
    Wang, Lei
    Zhou, Jun
    Zhang, Benyu
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 443 - 452