Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices

被引:41
|
作者
Lu, Zongqing [1 ]
Rallapalli, Swati [2 ]
Chan, Kevin [3 ]
La Porta, Thomas [4 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] IBM Res, Yorktown Hts, NY USA
[3] Army Res Lab, Washington, DC USA
[4] Penn State Univ, University Pk, PA 16802 USA
关键词
Convolutional neural networks; modeling; mobile devices;
D O I
10.1145/3123266.3123389
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Convolutional Neural Networks (CNNs) have revolutionized the research in computer vision, due to their ability to capture complex patterns, resulting in high inference accuracies. However, the increasingly complex nature of these neural networks means that they are particularly suited for server computers with powerful GPUs. We envision that deep learning applications will be eventually and widely deployed on mobile devices, e.g., smartphones, self-driving cars, and drones. Therefore, in this paper, we aim to understand the resource requirements (time, memory) of CNNs on mobile devices. First, by deploying several popular CNNs on mobile CPUs and GPUs, we measure and analyze the performance and resource usage for every layer of the CNNs. Our findings point out the potential ways of optimizing the performance on mobile devices. Second, we model the resource requirements of the different CNN computations. Finally, based on the measurement, profiling, and modeling, we build and evaluate our modeling tool, Augur, which takes a CNN configuration (descriptor) as the input and estimates the compute time and resource usage of the CNN, to give insights about whether and how efficiently a CNN can be run on a given mobile platform. In doing so Augur tackles several challenges: (i) how to overcome profiling and measurement overhead; (ii) how to capture the variance in different mobile platforms with different processors, memory, and cache sizes; and (iii) how to account for the variance in the number, type and size of layers of the different CNN configurations.
引用
收藏
页码:1663 / 1671
页数:9
相关论文
共 50 条
  • [1] Quantized Convolutional Neural Networks for Mobile Devices
    Wu, Jiaxiang
    Leng, Cong
    Wang, Yuhang
    Hu, Qinghao
    Cheng, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4820 - 4828
  • [2] Augur: Modeling the Resource Requirements of ConvNets on Mobile Devices
    Lu, Zongqing
    Rallapalli, Swati
    Chan, Kevin
    Pu, Shiliang
    La Porta, Thomas
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (02) : 352 - 365
  • [3] Optimization of Convolutional Neural Networks on Resource Constrained Devices
    Arish, S.
    Sinha, Sharad
    Smitha, K. G.
    [J]. 2019 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2019), 2019, : 19 - 24
  • [4] Optimizing Convolutional Neural Networks for low-resource devices
    Rusu, Cosmin-Ionut
    Czibula, Gabriela
    [J]. 2018 IEEE 14TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2018, : 245 - 252
  • [5] Plants recognition using embedded Convolutional Neural Networks on Mobile devices
    Pechebovicz, Denise
    Premebida, Sthefanie
    Soares, Vinicios
    Camargo, Thiago
    Bittencourt, Jakson L.
    Baroncini, Virginia
    Martins, Marcella
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2020, : 674 - 679
  • [6] Adapting Convolutional Neural Networks for Indoor Localization with Smart Mobile Devices
    Mittal, Ayush
    Tiku, Saideep
    Pasricha, Sudeep
    [J]. PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18), 2018, : 117 - 122
  • [7] Convolutional Neural Networks for Object Recognition on Mobile Devices: a Case Study
    Tobias, Luis
    Ducournau, Aurelien
    Rousseau, Francois
    Mercier, Gregoire
    Fablet, Ronan
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3530 - 3535
  • [8] DEEP CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC MODELING IN LOW RESOURCE LANGUAGES
    Chan, William
    Lane, Ian
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2056 - 2060
  • [9] Convolutional Neural Networks for Attribute-based Active Authentication On Mobile Devices
    Samangouei, Pouya
    Chellappa, Rama
    [J]. 2016 IEEE 8TH INTERNATIONAL CONFERENCE ON BIOMETRICS THEORY, APPLICATIONS AND SYSTEMS (BTAS), 2016,
  • [10] Fast Implementation of 4-bit Convolutional Neural Networks for Mobile Devices
    Trusov, Anton
    Limonova, Elena
    Slugin, Dmitry
    Nikolaev, Dmitry
    Arlazarov, Vladimir V.
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9897 - 9903