Evaluating Performance, Power and Energy of Deep Neural Networks on CPUs and GPUs

被引:3
|
作者
Sun, Yuyang [1 ]
Ou, Zhixin [1 ]
Chen, Juan [1 ]
Qi, Xinxin [1 ]
Guo, Yifei [1 ]
Cai, Shunzhe [1 ]
Yan, Xiaoming [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
来源
关键词
Convolutional neural network; Deep learning; Performance characterization; Power characterization; Energy characterization;
D O I
10.1007/978-981-16-7443-3_12
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep learning has achieved accuracy and fast training speed and has been successfully applied to many fields, including speech recognition, text processing, image processing and video processing. However, the cost of high power and energy comes together with the high accuracy and training speed of Deep Neural Network (DNN). This inspires researchers to perform characterization in terms of performance, power and energy for guiding the architecture design of DNN models. There are three critical issues to solve for designing a both accurate and energy-efficient DNN model: i) how the software parameters affect the DNN models; ii) how the hardware parameters affect the DNN models; and iii) how to choose the best energy-efficient DNN model. To answer the three issues above, we capture and analyze the performance, power and energy behaviors for multiple experiment settings. We evaluate four DNN models (i.e., LeNet, GoogLeNet, AlexNet, and CaffeNet) with various parameter settings (both hardware and software) on both CPU and GPU platforms. Evaluation results provide detailed DNN characterization and some key insights to facilitate the design of energy-efficient deep learning solutions.
引用
收藏
页码:196 / 221
页数:26
相关论文
共 50 条
  • [1] Evaluating the Energy Efficiency of Deep Convolutional Neural Networks on CPUs and GPUs
    Li, Da
    Chen, Xinbo
    Becchi, Michela
    Zong, Ziliang
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 477 - 484
  • [2] Evaluating Gather and Scatter Performance on CPUs and GPUs
    Lavin, Patrick
    Young, Jeffrey
    Vuduc, Richard
    Riedy, Jason
    Vose, Aaron
    Ernst, Daniel
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, MEMSYS 2020, 2020, : 209 - 222
  • [3] Performance of Training Sparse Deep Neural Networks on GPUs
    Wang, Jianzong
    Huang, Zhangcheng
    Kong, Lingwei
    Xiao, Jing
    Wang, Pengyu
    Zhang, Lu
    Li, Chao
    [J]. 2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2019,
  • [4] On Energy Nonproportionality of CPUs and GPUs
    Manumachu, Ravi Reddy
    Lastovetsky, Alexey
    [J]. 2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 34 - 44
  • [5] CGYRO Performance on Power9 CPUs and Volta GPUs
    Sfiligoi, I
    Candy, J.
    Kostuk, M.
    [J]. HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2018, 2018, 11203 : 365 - 372
  • [6] Evaluating the Performance and Energy Efficiency of N-Body Codes on Multi-Core CPUs and GPUs
    Zecena, Ivan
    Burtscher, Martin
    Jin, Tongdan
    Zong, Ziliang
    [J]. 2013 IEEE 32ND INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2013,
  • [7] PAKCK: Performance and Power Analysis of Key Computational Kernels on CPUs and GPUs
    Mullen, Julia S.
    Wolf, Michael M.
    Klein, Anna
    [J]. 2013 IEEE CONFERENCE ON HIGH PERFORMANCE EXTREME COMPUTING (HPEC), 2013,
  • [8] Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs
    Dos Santos, Fernando Fernandes
    Rech, Paolo
    Kritikakou, Angeliki
    Sentieys, Olivier
    [J]. 2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 327 - 327
  • [9] Evaluating LSTM Time Series Prediction Performance on Benchmark CPUs and GPUs in Cloud Environments
    Saha, Aditi
    Rahman, Mohammad
    Wu, Fan
    [J]. PROCEEDINGS OF THE 2024 ACM SOUTHEAST CONFERENCE, ACMSE 2024, 2024, : 321 - 322
  • [10] Detailed Characterization of Deep Neural Networks on GPUs and FPGAs
    Karki, Aajna
    Keshava, Chethan Palangotu
    Shivakumar, Spoorthi Mysore
    Skow, Joshua
    Hegde, Goutam Madhukeshwar
    Jeon, Hyeran
    [J]. 12TH WORKSHOP ON GENERAL PURPOSE PROCESSING USING GPUS (GPGPU 12), 2019, : 12 - 21