Thermal-Aware Scheduling for Deep Learning on Mobile Devices With NPU

被引:4
|
作者
Tan, Tianxiang [1 ]
Cao, Guohong [1 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Graphics processing units; Mobile handsets; Scheduling; Clocks; Performance evaluation; Temperature sensors; Mobile computing; Deep learning; mobile computing; power management; RESOURCE-MANAGEMENT;
D O I
10.1109/TMC.2024.3379501
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Deep Neural Networks (DNNs) have been successfully applied to various fields, there is a tremendous demand for running DNNs on mobile devices. Although mobile GPU can be leveraged to improve performance, it consumes a large amount of energy. After a short period of time, the mobile device may become overheated and the processors are forced to reduce the clock speed, significantly reducing the processing speed. A different approach to support DNNs on mobile device is to leverage the Neural Processing Units (NPUs). Compared to GPU, NPU is much faster and more energy efficient, but with lower accuracy due to the use of low precision floating-point numbers. We propose to combine these two approaches to improve the performance of running DNNs on mobile devices by studying the thermal-aware scheduling problem, where the goal is to achieve a better tradeoff between processing time and accuracy while ensuring that the mobile device is not overheated. To solve the problem, we propose a heuristic-based scheduling algorithm to determine when to run DNNs on GPU and when to run DNNs on NPU based on the current states of the mobile device. The heuristic-based algorithm makes scheduling decisions greedily and ignores their future impacts. Thus, we propose a deep reinforcement learning based scheduling algorithm to further improve performance. Extensive evaluation results show that the proposed algorithms can significantly improve the performance of running DNNs on mobile devices while avoiding overheating.
引用
收藏
页码:10706 / 10719
页数:14
相关论文
共 50 条
  • [1] Efficient NPU–GPU scheduling for real-time deep learning inference on mobile devices
    Chengwu Yu
    Meng Wang
    Shan Chen
    Wanqi Wang
    Weiwei Fang
    Yanming Chen
    Neal N.Xiong
    Journal of Real-Time Image Processing, 2025, 22 (2)
  • [2] A thermal-aware scheduling for multicore architectures
    Chien, Ting-Hsuan
    Chang, Rong-Guey
    JOURNAL OF SYSTEMS ARCHITECTURE, 2016, 62 : 54 - 62
  • [3] Thermal-Aware Scheduling in Green Data
    Chaudhry, Muhammad Tayyab
    Ling, Teck Chaw
    Manzoor, Atif
    Hussain, Syed Asad
    Kim, Jongwon
    ACM COMPUTING SURVEYS, 2015, 47 (03)
  • [4] Thermal-Aware Scheduling for Future Chip Multiprocessors
    Stavrou, Kyriakos
    Trancoso, Pedro
    EURASIP JOURNAL ON EMBEDDED SYSTEMS, 2007, (01)
  • [5] Thermal-Aware Sensor Scheduling for Distributed Estimation
    Forte, Domenic
    Srivastava, Ankur
    DISTRIBUTED COMPUTING IN SENSOR SYSTEMS, PROCEEDINGS, 2010, 6131 : 116 - 129
  • [6] Thermal-Aware Scheduling Collaborating with OS and Architecture
    Lee, Cheng-Yu
    Yang, Shuang-Jhu
    Chang, Rong-Guey
    2013 42ND ANNUAL INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2013, : 1044 - 1051
  • [7] Thermal-Aware Sensor Scheduling for Distributed Estimation
    Forte, Domenic
    Srivastava, Ankur
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2013, 9 (04)
  • [8] Regulating CPU temperature with thermal-aware scheduling using a reduced order learning thermal model
    Dowling, Anthony
    Jiang, Lin
    Cheng, Ming-Cheng
    Liu, Yu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
  • [9] Server temperature prediction using deep neural networks to assist thermal-aware scheduling
    Akbar, Saeed
    Li, Ruixuan
    Waqas, Muhammad
    Jan, Avais
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2022, 36
  • [10] Thermal-aware Task Scheduling at the System Software Level
    Choi, Jeonghwan
    Cher, Chen-Yong
    Franke, Hubertus
    Hamann, Hendrik
    Weger, Alan
    Bose, Pradip
    ISLPED'07: PROCEEDINGS OF THE 2007 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2007, : 213 - 218