ByRDiE: Byzantine-Resilient Distributed Coordinate Descent for Decentralized Learning

被引:70
|
作者
Yang, Zhixiong [1 ]
Bajwa, Waheed U. [1 ]
机构
[1] Rutgers Univ New Brunswick, Dept Elect & Comp Engn, 94 Brett Rd, Piscataway, NJ 08854 USA
基金
美国国家科学基金会;
关键词
Training; Optimization; Machine learning; Distributed algorithms; Training data; Information processing; Machine learning algorithms; Byzantine failure; consensus; coordinate descent; decentralized learning; distributed optimization; empirical risk minimization; machine learning; CONSENSUS; OPTIMIZATION; ALGORITHM; ADMM;
D O I
10.1109/TSIPN.2019.2928176
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Distributed machine learning algorithms enable learning of models from datasets that are distributed over a network without gathering the data at a centralized location. While efficient distributed algorithms have been developed under the assumption of faultless networks, failures that can render these algorithms nonfunctional occur frequently in the real world. This paper focuses on the problem of Byzantine failures, which are the hardest to safeguard against in distributed algorithms. While Byzantine fault tolerance has a rich history, existing work does not translate into efficient and practical algorithms for high-dimensional learning in fully distributed (also known as decentralized) settings. In this paper, an algorithm termed Byzantine-resilient distributed coordinate descent is developed and analyzed that enables distributed learning in the presence of Byzantine failures. Theoretical analysis (convex settings) and numerical experiments (convex and nonconvex settings) highlight its usefulness for high-dimensional distributed learning in the presence of Byzantine failures.
引用
收藏
页码:611 / 627
页数:17
相关论文
共 50 条
  • [31] Byzantine-Resilient High-Dimensional Federated Learning
    Data, Deepesh
    Diggavi, Suhas N.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (10) : 6639 - 6670
  • [32] Byzantine-Resilient Counting in Networks
    Chatterjee, Soumyottam
    Pandurangan, Gopal
    Robinson, Peter
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 12 - 22
  • [33] Byzantine-Resilient Distributed Bandit Online Optimization in Dynamic Environments
    Wei, Mengli
    Yu, Wenwu
    Liu, Hongzhe
    Chen, Duxin
    IEEE Transactions on Industrial Cyber-Physical Systems, 2024, 2 : 154 - 165
  • [34] DRACO: Byzantine-resilient Distributed Training via Redundant Gradients
    Chen, Lingjiao
    Wang, Hongyi
    Charles, Zachary
    Papailiopoulos, Dimitris
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [35] Byzantine-Resilient Distributed Optimization of Multi-Dimensional Functions
    Kuwaranancharoen, Kananart
    Xin, Lei
    Sundaram, Shreyas
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 4399 - 4404
  • [36] BYZANTINE-RESILIENT DISTRIBUTED LARGE-SCALE MATRIX COMPLETION
    Lin, Feng
    Ling, Qing
    Xiong, Zhiwei
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 8167 - 8171
  • [37] Byzantine-Resilient Multiagent Optimization
    Su, Lili
    Vaidya, Nitin H.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (05) : 2227 - 2233
  • [38] Byzantine-Tolerant Distributed Coordinate Descent
    Data, Deepesh
    Diggavi, Suhas
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 2724 - 2728
  • [39] Fully decentralized privacy-enabled Federated Learning system based on Byzantine-resilient consensus protocol
    Ferenczi, Andras
    Badica, Costin
    SIMULATION MODELLING PRACTICE AND THEORY, 2024, 136
  • [40] A recursive Byzantine-resilient protocol
    Cheng, Chien-Fu
    Tsai, Kuo-Tang
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2015, 48 : 87 - 98