Thermal-Aware Scheduling for Deep Learning on Mobile Devices With NPU

被引:4
|
作者
Tan, Tianxiang [1 ]
Cao, Guohong [1 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Graphics processing units; Mobile handsets; Scheduling; Clocks; Performance evaluation; Temperature sensors; Mobile computing; Deep learning; mobile computing; power management; RESOURCE-MANAGEMENT;
D O I
10.1109/TMC.2024.3379501
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Deep Neural Networks (DNNs) have been successfully applied to various fields, there is a tremendous demand for running DNNs on mobile devices. Although mobile GPU can be leveraged to improve performance, it consumes a large amount of energy. After a short period of time, the mobile device may become overheated and the processors are forced to reduce the clock speed, significantly reducing the processing speed. A different approach to support DNNs on mobile device is to leverage the Neural Processing Units (NPUs). Compared to GPU, NPU is much faster and more energy efficient, but with lower accuracy due to the use of low precision floating-point numbers. We propose to combine these two approaches to improve the performance of running DNNs on mobile devices by studying the thermal-aware scheduling problem, where the goal is to achieve a better tradeoff between processing time and accuracy while ensuring that the mobile device is not overheated. To solve the problem, we propose a heuristic-based scheduling algorithm to determine when to run DNNs on GPU and when to run DNNs on NPU based on the current states of the mobile device. The heuristic-based algorithm makes scheduling decisions greedily and ignores their future impacts. Thus, we propose a deep reinforcement learning based scheduling algorithm to further improve performance. Extensive evaluation results show that the proposed algorithms can significantly improve the performance of running DNNs on mobile devices while avoiding overheating.
引用
收藏
页码:10706 / 10719
页数:14
相关论文
共 50 条
  • [21] Optimized Thermal-Aware Job Scheduling and Control of Data Centers
    Van Damme, Tobias
    De Persis, Claudio
    Tesi, Pietro
    IFAC PAPERSONLINE, 2017, 50 (01): : 8244 - 8249
  • [22] Thermal-aware Device Design of Nano-scale Devices
    Uchida, Ken
    Takahashi, Tsunaki
    2014 12TH IEEE INTERNATIONAL CONFERENCE ON SOLID-STATE AND INTEGRATED CIRCUIT TECHNOLOGY (ICSICT), 2014,
  • [23] Adaptive thermal-aware task scheduling for multi-core systems
    Chu, Hsin-Hao
    Kao, Yu-Chon
    Chen, Ya-Shu
    JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 99 : 155 - 174
  • [24] Thermal-Aware SoC Test Scheduling with Test Set Partitioning and Interleaving
    Zhiyuan He
    Zebo Peng
    Petru Eles
    Paul Rosinger
    Bashir M. Al-Hashimi
    Journal of Electronic Testing, 2008, 24 : 247 - 257
  • [25] Thermal-Aware Scheduling for MPSoC in the Avionics Domain: Tooling and Initial Results
    Benedikt, Ondrej
    Sojka, Michal
    Zaykov, Pavel
    Hornof, David
    Kafka, Matej
    Sucha, Premysl
    Hanzalek, Zdenek
    2021 IEEE 27TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA 2021), 2021, : 159 - 168
  • [26] Thermal-Aware Task Scheduling for 3D Multicore Processors
    Zhou, Xiuyi
    Yang, Jun
    Xu, Yi
    Zhang, Youtao
    Zhao, Jianhua
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2010, 21 (01) : 60 - 71
  • [27] Thermal-Aware Scheduling of Batch Jobs in Geographically Distributed Data Centers
    Polverini, Marco
    Cianfrani, Antonio
    Ren, Shaolei
    Vasilakos, Athanasios V.
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2014, 2 (01) : 71 - 84
  • [28] Thermal-aware application scheduling on device-heterogeneous embedded architectures
    Swaminathan, Karthik
    Kotra, Jagadish
    Liu, Huichu
    Sampson, Jack
    Kandemir, Mahmut
    Narayanan, Vijaykrishnan
    2015 28TH INTERNATIONAL CONFERENCE ON VLSI DESIGN (VLSID), 2015, : 221 - 226
  • [29] Thermal-Aware Job Allocation and Scheduling for Three Dimensional Chip Multiprocessor
    Liu, Shaobo
    Zhang, Jingyi
    Wu, Qing
    Qiu, Qinru
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2010), 2010, : 390 - 398
  • [30] A Fast Scheme to Investigate Thermal-Aware Scheduling Policy for Multicore Processors
    He, Liqiang
    Narisu, Cha
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2009, 5737 : 1 - 10