Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices

被引:41
|
作者
Lu, Zongqing [1 ]
Rallapalli, Swati [2 ]
Chan, Kevin [3 ]
La Porta, Thomas [4 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] IBM Res, Yorktown Hts, NY USA
[3] Army Res Lab, Washington, DC USA
[4] Penn State Univ, University Pk, PA 16802 USA
关键词
Convolutional neural networks; modeling; mobile devices;
D O I
10.1145/3123266.3123389
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Convolutional Neural Networks (CNNs) have revolutionized the research in computer vision, due to their ability to capture complex patterns, resulting in high inference accuracies. However, the increasingly complex nature of these neural networks means that they are particularly suited for server computers with powerful GPUs. We envision that deep learning applications will be eventually and widely deployed on mobile devices, e.g., smartphones, self-driving cars, and drones. Therefore, in this paper, we aim to understand the resource requirements (time, memory) of CNNs on mobile devices. First, by deploying several popular CNNs on mobile CPUs and GPUs, we measure and analyze the performance and resource usage for every layer of the CNNs. Our findings point out the potential ways of optimizing the performance on mobile devices. Second, we model the resource requirements of the different CNN computations. Finally, based on the measurement, profiling, and modeling, we build and evaluate our modeling tool, Augur, which takes a CNN configuration (descriptor) as the input and estimates the compute time and resource usage of the CNN, to give insights about whether and how efficiently a CNN can be run on a given mobile platform. In doing so Augur tackles several challenges: (i) how to overcome profiling and measurement overhead; (ii) how to capture the variance in different mobile platforms with different processors, memory, and cache sizes; and (iii) how to account for the variance in the number, type and size of layers of the different CNN configurations.
引用
收藏
页码:1663 / 1671
页数:9
相关论文
共 50 条
  • [21] Mobile Convolutional Neural Networks for Facial Expression Recognition
    Yoon, ChangRak
    Kim, DoHyun
    [J]. 11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1315 - 1317
  • [22] Using Convolutional Neural Network for Image hnhancement on Mobile Devices
    Despotovici, Mihai
    Mocanu, Irina
    Lucia, Rusu
    [J]. 2019 IEEE 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2019), 2019, : 359 - 364
  • [23] Merging Deep Neural Networks for Mobile Devices
    Chou, Yi-Min
    Chan, Yi-Ming
    Lee, Jia-Hong
    Chiu, Chih-Yi
    Chen, Chu-Song
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1767 - 1775
  • [24] Real-Time Physical Activity Recognition on Smart Mobile Devices Using Convolutional Neural Networks
    Peppas, Konstantinos
    Tsolakis, Apostolos C.
    Krinidis, Stelios
    Tzovaras, Dimitrios
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 25
  • [25] Parallel Recurrent Convolutional Neural Networks-Based Music Genre Classification Method for Mobile Devices
    Yang, Rui
    Feng, Lin
    Wang, Huibing
    Yao, Jianing
    Luo, Sen
    [J]. IEEE ACCESS, 2020, 8 (19629-19637) : 19629 - 19637
  • [26] Reducing Memory Requirements of Convolutional Neural Networks for Inference at the Edge
    Bravenec, Tomas
    Fryza, Tomas
    [J]. 2021 31ST INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2021,
  • [27] Cloud-assisted collaborative inference of convolutional neural networks for vision tasks on resource-constrained devices
    Rodriguez-Conde, Ivan
    Campos, Celso
    Fdez-Riverola, Florentino
    [J]. NEUROCOMPUTING, 2023, 560
  • [28] Resource Efficient 3D Convolutional Neural Networks
    Koepueklue, Okan
    Kose, Neslihan
    Gunduz, Ahmet
    Rigoll, Gerhard
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1910 - 1919
  • [29] INVESTIGATIONS ON BYTE-LEVEL CONVOLUTIONAL NEURAL NETWORKS FOR LANGUAGE MODELING IN LOW RESOURCE SPEECH RECOGNITION
    Irie, Kazuki
    Golik, Pavel
    Schlueter, Ralf
    Ney, Hermann
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5740 - 5744
  • [30] Performance comparison of wake-up-word detection on mobile devices using various convolutional neural networks
    Kim, Sanghong
    Lee, Bowon
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (05): : 454 - 460