Thermal-Aware Scheduling for Deep Learning on Mobile Devices With NPU

被引:4
|
作者
Tan, Tianxiang [1 ]
Cao, Guohong [1 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Graphics processing units; Mobile handsets; Scheduling; Clocks; Performance evaluation; Temperature sensors; Mobile computing; Deep learning; mobile computing; power management; RESOURCE-MANAGEMENT;
D O I
10.1109/TMC.2024.3379501
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Deep Neural Networks (DNNs) have been successfully applied to various fields, there is a tremendous demand for running DNNs on mobile devices. Although mobile GPU can be leveraged to improve performance, it consumes a large amount of energy. After a short period of time, the mobile device may become overheated and the processors are forced to reduce the clock speed, significantly reducing the processing speed. A different approach to support DNNs on mobile device is to leverage the Neural Processing Units (NPUs). Compared to GPU, NPU is much faster and more energy efficient, but with lower accuracy due to the use of low precision floating-point numbers. We propose to combine these two approaches to improve the performance of running DNNs on mobile devices by studying the thermal-aware scheduling problem, where the goal is to achieve a better tradeoff between processing time and accuracy while ensuring that the mobile device is not overheated. To solve the problem, we propose a heuristic-based scheduling algorithm to determine when to run DNNs on GPU and when to run DNNs on NPU based on the current states of the mobile device. The heuristic-based algorithm makes scheduling decisions greedily and ignores their future impacts. Thus, we propose a deep reinforcement learning based scheduling algorithm to further improve performance. Extensive evaluation results show that the proposed algorithms can significantly improve the performance of running DNNs on mobile devices while avoiding overheating.
引用
收藏
页码:10706 / 10719
页数:14
相关论文
共 50 条
  • [41] Thermal-Aware Task Scheduling for Data centers through Minimizing Heat Recirculation
    Tang, Qinghui
    Gupta, Sandeep K. S.
    Varsamopoulos, Georgios
    2007 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, 2007, : 129 - 138
  • [42] Thermal-aware Scheduling for Data Parallel Workloads on Multi-Core Processors
    Tan, Hengxing
    Ranka, Sanjay
    2014 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATION (ISCC), 2014,
  • [43] Energy-Efficient Thermal-Aware Scheduling for RT Tasks Using TCPN
    Rubio-Anguiano, L.
    Desirena-Lopez, G.
    Ramirez-Trevino, A.
    Briz, J. L.
    IFAC PAPERSONLINE, 2018, 51 (07): : 236 - 242
  • [44] Thermal-aware Test Scheduling Strategy for Network-on-Chip based Systems
    Manna, Kanchan
    Sagar, Chatla Swami
    Chattopadhyay, Santanu
    Sengupta, Indranil
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2019, 15 (01)
  • [45] State-of-the-Art Energy-Efficient Thermal-Aware Scheduling in Cloud
    Garg, Ritu
    Rani, Rama
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR COMPETITIVE STRATEGIES, 2019, 40 : 157 - 164
  • [46] TaSaT: Thermal-Aware Scheduling and Tuning Algorithm for Heterogeneous and Configurable Embedded Systems
    Alsafrjalani, Mohamad Hammam
    Adegbija, Tosiron
    PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18), 2018, : 75 - 80
  • [47] Thermal-aware SoC Test Scheduling with Voltage/Frequency Scaling and Test Partition
    Ying Zhang
    Li Ling
    Jianhui Jiang
    Jie Xiao
    Journal of Electronic Testing, 2018, 34 : 447 - 460
  • [48] Thermal-aware MapReduce Real-Time Scheduling in Heterogeneous Server Systems
    Tang, Tzu-Chen
    Chen, Ya-Shu
    2016 RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS, 2016, : 207 - 212
  • [49] Power and thermal-aware virtual machine scheduling optimization in cloud data center
    Chen, Rui
    Liu, Bo
    Lin, WeiWei
    Lin, JianPeng
    Cheng, HuiWen
    Li, KeQin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 145 : 578 - 589
  • [50] Thermal-aware global real-time scheduling and analysis on multicore systems
    Fisher, Nathan
    Chen, Jian-Jia
    Wang, Shengquan
    Thiele, Lothar
    JOURNAL OF SYSTEMS ARCHITECTURE, 2011, 57 (05) : 547 - 560