Coordinated DVFS and Precision Control for Deep Neural Networks

被引:18
|
作者
Nabavinejad, Seyed Morteza [1 ]
Hafez-Kolahi, Hassan [2 ]
Reda, Sherief [3 ]
机构
[1] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
[2] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
[3] Brown Univ, Sch Engn, Providence, RI 02912 USA
基金
美国国家科学基金会;
关键词
Graphics processing units; Power demand; Time factors; Runtime; Time-frequency analysis; Servers; Neural networks; Deep neural network; hardware accelerator; power; accuracy; response time;
D O I
10.1109/LCA.2019.2942020
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Traditionally, DVFS has been the main mechanism to trade-off performance and power. We observe that Deep Neural Network (DNN) applications offer the possibility to trade-off performance, power, and accuracy using both DVFS and numerical precision levels. Our proposed approach, Power-Inference accuracy Trading (PIT), monitors the servers load, and accordingly adjusts the precision of the DNN model and the DVFS setting of GPU to trade-off the accuracy and power consumption with response time. At high loads and tight request arrivals, PIT leverages INT8-precision instructions of GPU to dynamically change the precision of deployed DNN models and boosts GPU frequency to execute the requests faster at the expense of accuracy reduction and high power consumption. However, when the requests arrival rate is relaxed and there is slack time for requests, PIT deploys high precision version of models to improve the accuracy and reduces GPU frequency to decrease power consumption. We implement and deploy PIT on a state-of-the-art server equipped with a Tesla P40 GPU. Experimental results demonstrate that depending on the load, PIT can improve response time up to 11 percent compared to a job scheduler that uses only FP32 precision. It also improves the energy consumption by up to 28 percent, while achieving around 99.5 percent accuracy of sole FP32-precision.
引用
收藏
页码:136 / 140
页数:5
相关论文
共 50 条
  • [31] Langevin Algorithms for Markovian Neural Networks and Deep Stochastic Control
    Bras, Pierre
    Pages, Gilles
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [32] Financial Risk Control Model Based on Deep Neural Networks
    Xu, Jinghong
    Yang, Daguang
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [33] A systematic review on overfitting control in shallow and deep neural networks
    Bejani, Mohammad Mahdi
    Ghatee, Mehdi
    ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (08) : 6391 - 6438
  • [34] Control of Acoustic Extinguisher with Deep Neural Networks for Fire Detection
    Wilk-Jakubowski, Jacek Lukasz
    Stawczyk, Pawel
    Ivanov, Stefan
    Stankov, Stanko
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2022, 28 (01) : 52 - 59
  • [35] Credit Assignment in Neural Networks through Deep Feedback Control
    Meulemans, Alexander
    Farinha, Matilde Tristany
    Ordonez, Javier Garcia
    Aceituno, Pau Vilimelis
    Sacramento, Joao
    Grewe, Benjamin F.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [36] Adaptive Control of Robotic Manipulators using Deep Neural Networks
    Ganie, Irfan
    Jagannathan, S.
    IFAC PAPERSONLINE, 2022, 55 (15): : 148 - 153
  • [37] A systematic review on overfitting control in shallow and deep neural networks
    Mohammad Mahdi Bejani
    Mehdi Ghatee
    Artificial Intelligence Review, 2021, 54 : 6391 - 6438
  • [38] High precision single-photon object detection via deep neural networks
    Li, Xiaozhe
    Liu, Jinyi
    Zhao, Guoyang
    Liu, Lijun
    Zhang, Weiping
    Hu, Xiaomin
    Cheng, Shuming
    OPTICS EXPRESS, 2024, 32 (21): : 37224 - 37237
  • [39] 3-D Precision Positioning Based on Deep Comparison Convolutional Neural Networks
    Wen, Bo-Xu
    Li, Chih-Hung G.
    2023 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, AIM, 2023, : 1330 - 1335
  • [40] Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs
    Dos Santos, Fernando Fernandes
    Rech, Paolo
    Kritikakou, Angeliki
    Sentieys, Olivier
    2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 327 - 327