Thermal-Aware Scheduling for Deep Learning on Mobile Devices With NPU

被引:4
|
作者
Tan, Tianxiang [1 ]
Cao, Guohong [1 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Graphics processing units; Mobile handsets; Scheduling; Clocks; Performance evaluation; Temperature sensors; Mobile computing; Deep learning; mobile computing; power management; RESOURCE-MANAGEMENT;
D O I
10.1109/TMC.2024.3379501
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Deep Neural Networks (DNNs) have been successfully applied to various fields, there is a tremendous demand for running DNNs on mobile devices. Although mobile GPU can be leveraged to improve performance, it consumes a large amount of energy. After a short period of time, the mobile device may become overheated and the processors are forced to reduce the clock speed, significantly reducing the processing speed. A different approach to support DNNs on mobile device is to leverage the Neural Processing Units (NPUs). Compared to GPU, NPU is much faster and more energy efficient, but with lower accuracy due to the use of low precision floating-point numbers. We propose to combine these two approaches to improve the performance of running DNNs on mobile devices by studying the thermal-aware scheduling problem, where the goal is to achieve a better tradeoff between processing time and accuracy while ensuring that the mobile device is not overheated. To solve the problem, we propose a heuristic-based scheduling algorithm to determine when to run DNNs on GPU and when to run DNNs on NPU based on the current states of the mobile device. The heuristic-based algorithm makes scheduling decisions greedily and ignores their future impacts. Thus, we propose a deep reinforcement learning based scheduling algorithm to further improve performance. Extensive evaluation results show that the proposed algorithms can significantly improve the performance of running DNNs on mobile devices while avoiding overheating.
引用
收藏
页码:10706 / 10719
页数:14
相关论文
共 50 条
  • [31] Thermal-aware SoC test scheduling with test set partitioning and interleaving
    He, Zhiyuan
    Peng, Zebo
    Eles, Petru
    Rosinger, Paul
    Al-Hashimi, Bashir M.
    21ST IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT-TOLERANCE IN VLSI SYSTEMS, PROCEEDINGS, 2006, : 477 - +
  • [32] Thermal-aware SoC test scheduling with test set partitioning and interleaving
    He, Zhiyuan
    Peng, Zebo
    Eles, Petru
    Rosinger, Paul
    Al-Hashimi, Bashir M.
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2008, 24 (1-3): : 247 - 257
  • [33] Thermal-aware Job Scheduling of MapReduce Applications on High Performance Clusters
    Taneja, Shubbhi
    Zhou, Yi
    Alghamdi, Mohammed I.
    Qin, Xiao
    2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW), 2017, : 261 - 270
  • [34] Thermal-aware task scheduling for hot-spots avoidance in cloud
    Wang, Jiru
    Wang, Jihe
    Guo, Bing
    Shen, Yan
    Journal of Computational Information Systems, 2015, 11 (10): : 3665 - 3673
  • [35] Thermal-Aware Global Real-Time Scheduling on Multicore Systems
    Fisher, Nathan
    Chen, Jian-Jia
    Wang, Shengquan
    Thiele, Lothar
    15TH IEEE REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATION SYMPOSIUM: RTAS 2009, PROCEEDINGS, 2009, : 131 - +
  • [36] <bold>Thermal-Aware Scheduling: A solution for Future Chip Multiprocessors Thermal Problems</bold>
    Stavrou, Kyrlakos
    Trancoso, Pedro
    DSD 2006: 9TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN: ARCHITECTURES, METHODS AND TOOLS, PROCEEDINGS, 2006, : 123 - +
  • [37] Thermal-aware Body Bias Modulation for High Performance Mobile Core
    Oh, Chungki
    Kim, Hyung-Ock
    Seomun, Jun
    Kim, Wook
    Jeon, Jaehan
    Do, Kyung-Tae
    Won, Hyo-Sig
    Kim, Kee Sup
    2012 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2012, : 147 - 150
  • [38] Thermal-Aware Overclocking for Smartphones
    Srinivasa, Guru Prasad
    Werner, David
    Hempstead, Mark
    Challen, Geoffrey
    2021 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2021), 2021, : 220 - 222
  • [39] A thermal-aware superscalar microprocessor
    Lim, CH
    Daasch, WR
    Cai, G
    PROCEEDING OF THE 2002 3RD INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, 2002, : 517 - 522
  • [40] Thermal-aware clustered microarchitectures
    Chaparro, P
    González, J
    González, A
    IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS & PROCESSORS, PROCEEDINGS, 2004, : 48 - 53