Design and Performance Analysis of Partial Computation Output Schemes for Accelerating Coded Machine Learning

被引:1
|
作者
Xu, Xinping [1 ,2 ]
Lin, Xiaojun [3 ]
Duan, Lingjie [4 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Berkeley Educ Alliance Res Singapore, Singapore 138602, Singapore
[3] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[4] Singapore Univ Technol & Design, Engn Syst & Design Pillar, Singapore 487372, Singapore
基金
美国国家科学基金会;
关键词
Runtime; Task analysis; Codes; Machine learning; Encoding; Sparse matrices; Servers; Coded machine learning; maximum-distance-separable codes; partial computation outputs; performance bound analysis;
D O I
10.1109/TNSE.2022.3228322
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Coded machine learning is a technique to use codes, such as (n, q)-maximum-distance-separable ((n, q)-MDS) codes, to reduce the negative effect of stragglers by requiring q out of n workers to complete their computation. However, the MDS scheme incurs significant inefficiency in wasting stragglers' unfinished computation and keeping faster workers idle. Accordingly, this paper proposes to fragment each worker's load into small pieces and utilizes all workers' partial computation outputs (PCO) to reduce the overall runtime. While easy-to-implement, the theoretical runtime performance analysis of our PCO scheme is challenging. We present new bounds and asymptotic analysis to prove that our PCO scheme always reduces the overall runtime for any random distribution of workers' speeds, and its performance gain over the MDS scheme can be arbitrarily large under high variability of workers' speeds. Moreover, our analysis shows another advantage: the PCO scheme's performance is robust and insensitive to system parameter variations, while the MDS scheme has to know workers' speeds for carefully optimizing q. Finally, our realistic experiments validate that the PCO scheme reduces the overall runtime from that of the MDS scheme by at least 12.3%, and we implement our PCO scheme for solving a typical machine learning problem of linear regression.
引用
收藏
页码:1119 / 1130
页数:12
相关论文
共 50 条
  • [21] High-Throughput Computation and Machine Learning Prediction Accelerating the Design of Cathode Catalysts for Li-CO2 Batteries
    Yao, Tengyu
    Xu, Zhenming
    Hu, Tingsong
    Hu, Kang
    Cui, Xueliang
    Shen, Laifa
    JOURNAL OF PHYSICAL CHEMISTRY C, 2024, 128 (28): : 11534 - 11542
  • [22] Performance Analysis of Coded OFDM System Using Various Coding Schemes
    Dwivedi, Vivek K.
    Gupta, Abhinav
    Kumar, Richansh
    Singh, G.
    PIERS 2009 MOSCOW VOLS I AND II, PROCEEDINGS, 2009, : 1249 - +
  • [23] Performance analysis of trellis coded beamforming schemes for MIMO fading channels
    Chu, L
    Yuan, J
    2004 9TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS), 2004, : 270 - 274
  • [24] Machine Learning design of Volume of Fluid schemes for compressible flows
    Despres, Bruno
    Jourdren, Herve
    JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 408
  • [25] Machine learning and computation-enabled intelligent sensor design
    Ballard, Zachary
    Brown, Calvin
    Madni, Asad M.
    Ozcan, Aydogan
    NATURE MACHINE INTELLIGENCE, 2021, 3 (07) : 556 - 565
  • [26] Machine learning and computation-enabled intelligent sensor design
    Zachary Ballard
    Calvin Brown
    Asad M. Madni
    Aydogan Ozcan
    Nature Machine Intelligence, 2021, 3 : 556 - 565
  • [27] Automatic design of machine learning via evolutionary computation: A survey
    Li, Nan
    Ma, Lianbo
    Xing, Tiejun
    Yu, Guo
    Wang, Chen
    Wen, Yingyou
    Cheng, Shi
    Gao, Shangce
    APPLIED SOFT COMPUTING, 2023, 143
  • [28] Comparative Analysis of Routing Schemes Based on Machine Learning
    Yang, Shaoyu
    Tan, Cong
    Madsen, Dag Oivind
    Xiang, Haige
    Li, Yun
    Khan, Imran
    Choi, Bong Jun
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [29] The Performance Evaluation to a Smart Robots Embedded with Machine Learning Schemes
    Chen, Joy Iong-Zong
    Hengjinda, Pisith
    Hsu, Shu Rui
    2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 408 - 411
  • [30] Learning Everywhere: Pervasive Machine Learning for Effective High-Performance Computation
    Fox, Geoffrey
    Glazier, James A.
    Kadupitiya, J. C. S.
    Jadhao, Vikram
    Kim, Minje
    Qiu, Judy
    Sluka, James P.
    Somogyi, Endre
    Marathe, Madhav
    Adiga, Abhijin
    Chen, Jiangzhuo
    Beckstein, Oliver
    Jha, Shantenu
    2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 422 - 429