ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators

被引:0
|
作者
Azhar, Muhammad Waqar [1 ]
Zouzoula, Stavroula [1 ]
Trancoso, Pedro [1 ]
机构
[1] Chalmers Univ Technol, Gothenburg, Sweden
来源
PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2023, CF 2023 | 2023年
关键词
CNNs; Energy Efficiency; Resource Allocation; Accelerators;
D O I
10.1145/3587135.3592207
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Learning (DL) applications are entering every part of our life given their ability to solve complex problems. Nevertheless, energy efficiency is still a major concern due to the large computational and memory requirements. State-of-the-art accelerators strive to address this issue by optimizing the architecture to the compute requirements of DL algorithms. However, there is always a mismatch between compute and memory requirements and what is offered by a particular design. A way to close this gap is by providing run-time adaptation or resource allocation to improve efficiency. This paper proposes an adaptive resource allocation for deep learning applications (ARADA) with the goal of improving energy efficiency for deep learning accelerators. This is leveraged by having a layer-by-layer resource allocation. The rationale is that each layer in the DL model has a unique compute and memory bandwidth requirement and allocating fixed resources to all layers leads to inefficiencies. This can be achieved by means of resource allocation (e.g., voltage-frequency, memory bandwidth) to save energy without sacrificing performance. Experimental results show that applying ARADA to the execution of 9 state-of-the-art CNN models results in an energy savings of 38% on average compared to race-to-idle for an Edge TPU coupled with LPDDR4 off-chip memory.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [41] Resource Allocation for Phantom Cellular Networks: Energy Efficiency vs Spectral Efficiency
    Abdelhady, Amr M.
    Amin, Osama
    Alouini, Mohamed-Slim
    2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016, : 874 - 879
  • [42] A resource allocation and cell association algorithm for energy efficiency in HetNets
    Zhu, Wenxiang
    Xu, Pingping
    Jiang, Huilin
    He, Ying
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2019, 32 (16)
  • [43] Resource Allocation for Green Cognitive Radios: Energy Efficiency Maximization
    Yang, Zhou
    Jiang, Wenqian
    Li, Gang
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,
  • [44] Resource Allocation for Energy Efficiency in OFDMA-Enabled WPCN
    Nguyen, Tien-Tung
    Pham, Quoc-Viet
    Nguyen, Van-Dinh
    Lee, Jong-Ho
    Kim, Yong-Hwa
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (12) : 2049 - 2053
  • [45] Improving energy efficiency in WSN through adaptive memetic-based clustering and routing for resource management
    Vimalarani, C.
    Selvi, C. P. Thamil
    Gopinathan, B.
    Kalavani, T.
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2025, 45
  • [46] Deep Reinforcement Learning for Online Resource Allocation in Network Slicing
    Cai, Yue
    Cheng, Peng
    Chen, Zhuo
    Ding, Ming
    Vucetic, Branka
    Li, Yonghui
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 7099 - 7116
  • [47] Deep learning double auction algorithm for resource optimal allocation
    Zheng Y.-C.
    Li Z.-N.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (10): : 1863 - 1872
  • [48] RADDPG: Resource Allocation in Cognitive Radio with Deep Reinforcement Learning
    Mishra, Nikita
    Srivastava, Sumit
    Sharan, Shivendra Nath
    2021 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2021, : 589 - 595
  • [49] ReCARL: Resource Allocation in Cloud RANs With Deep Reinforcement Learning
    Xu, Zhiyuan
    Tang, Jian
    Yin, Chengxiang
    Wang, Yanzhi
    Xue, Guoliang
    Wang, Jing
    Gursoy, M. Cenk
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (07) : 2533 - 2545
  • [50] Deep Reinforcement Learning Based Resource Allocation for Heterogeneous Networks
    Yang, Helin
    Zhao, Jun
    Lam, Kwok-Yan
    Garg, Sahil
    Wu, Qingqing
    Xiong, Zehui
    2021 17TH INTERNATIONAL CONFERENCE ON WIRELESS AND MOBILE COMPUTING, NETWORKING AND COMMUNICATIONS (WIMOB 2021), 2021, : 253 - 258