Resource-aware in-edge distributed real-time deep learning

被引:0
|
作者
Yoosefi, Amin [1 ]
Kargahi, Mehdi [1 ,2 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
关键词
Distributed deep learning; Edge computing; Real-time embedded systems; Resource constraints;
D O I
10.1016/j.iot.2024.101263
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNNs) are widely used in IoT devices for applications like pattern recognition. However, slight variations in the input data may cause considerable accuracy loss, while capturing all data variations to provide a rich training dataset is almost unrealistic. Online learning can assist by offering to continue adapting the model to the data variations even during inference, however at the expense of higher resource demands, namely a challenging requirement for resource-constrained IoT devices. Furthermore, training on a data sample must be concluded in a timely manner, to have the model updated for subsequent data inferences, compelling the data inter-arrival time as a time constraint. Distributed learning can mitigate the per-device resource demand by splitting the model and placing the partitions on the IoT devices. However, the previous distributed learning studies primarily aim to improve the throughput (through accelerating the training by large-scale CPU or GPU clusters), with less attention to the timeliness constraints. This paper, however, pays attention to some application-specific constraints of timeliness and accuracy under IoT device resource limitations using modular neural networks (MNNs). The MNN clusters the input space using a proposed online approach, where a module is specialized to each of the dynamic data clusters to perform inference. The MNN adjusts its computational complexity adaptively by adding, removing, and tuning the module clusters as new data arrives. The simulation results show that the proposed method effectively adheres to the application constraints and the device resource limitations.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] Resource-Aware Parameter Tuning for Real-Time Applications
    Gabriel, Dirk
    Stechele, Walter
    Wildermann, Stefan
    [J]. ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2019, 2019, 11479 : 45 - 55
  • [2] Adaptive Real-Time Clustering Algorithm with Resource-Aware
    Wang, Xiaoni
    [J]. LISS 2014, 2015, : 1635 - 1639
  • [3] A shared resource-aware real-time task allocation algorithm
    Yang, Mao-Lin
    Lei, Hang
    Liao, Yong
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2014, 37 (07): : 1455 - 1465
  • [4] Resource-Aware Partitioned Scheduling for Heterogeneous Multicore Real-Time Systems
    Han, Jian-Jun
    Cai, Wen
    Zhu, Dakai
    [J]. 2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [5] FPGA Resource-aware Structured Pruning for Real-Time Neural Networks
    Ramhorst, Benjamin
    Loncar, Vladimir
    Constantinides, George A.
    [J]. 2023 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, ICFPT, 2023, : 282 - 283
  • [6] Resource-Aware Split Federated Learning for Edge Intelligence
    Arouj, Amna
    Abdelmoniem, Ahmed M.
    Alhilal, Ahmad
    You, Linlin
    Wang, Chen
    [J]. PROCEEDINGS 2024 IEEE 3RD WORKSHOP ON MACHINE LEARNING ON EDGE IN SENSOR SYSTEMS, SENSYS-ML 2024, 2024, : 15 - 20
  • [7] DISTREAL: Distributed Resource-Aware Learning in Heterogeneous Systems
    Rapp, Martin
    Khalili, Ramin
    Pfeiffer, Kilian
    Henkel, Joerg
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8062 - 8071
  • [8] RASM: Resource-Aware Service Migration in Edge Computing based on Deep Reinforcement Learning
    Mwasinga, Lusungu Josh
    Le, Duc-Tai
    Raza, Syed M.
    Challa, Rajesh
    Kim, Moonseong
    Choo, Hyunseung
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 182
  • [9] CoEdge: A Cooperative Edge System for Distributed Real-Time Deep Learning Tasks
    Jiang, Zhehao
    Ling, Neiwen
    Huang, Xuan
    Shi, Shuyao
    Wu, Chenhao
    Zhao, Xiaoguang
    Yan, Zhenyu
    Xing, Guoliang
    [J]. PROCEEDINGS OF THE 2023 THE 22ND INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, IPSN 2023, 2023, : 53 - 66
  • [10] Time-Sensitive and Resource-Aware Concurrent Workflow Scheduling for Edge Computing Platforms Based on Deep Reinforcement Learning
    Zhang, Jiaming
    Wang, Tao
    Cheng, Lianglun
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (19):