Deep Learning on Microcontrollers: A Study on Deployment Costs and Challenges

被引:6
|
作者
Svoboda, Filip [1 ]
Fernandez-Marques, Javier [2 ]
Liberis, Edgar [1 ]
Lane, Nicholas D. [1 ,2 ]
机构
[1] Univ Cambridge, Cambridge, England
[2] Samsung AI, Suwon, South Korea
基金
欧洲研究理事会; 英国工程与自然科学研究理事会;
关键词
microcontrollers; neural networks; compression; quantization;
D O I
10.1145/3517207.3526978
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Microcontrollers are an attractive deployment target due to their low cost, modest power usage and abundance in the wild. However, deploying models to such hardware is non-trivial due to a small amount of on-chip RAM (often < 512KB) and limited compute capabilities. In this work, we delve into the requirements and challenges of fast DNN inference on MCUs: we describe how the memory hierarchy influences the architecture of the model, expose often under-reported costs of compression and quantization techniques, and high-light issues that become critical when deploying to MCUs compared to mobiles. Our findings and experiences are also distilled into a set of guidelines that should ease the future deployment of DNN-based applications on microcontrollers.
引用
收藏
页码:54 / 63
页数:10
相关论文
共 50 条
  • [1] Quantization and Deployment of Deep Neural Networks on Microcontrollers
    Novac, Pierre-Emmanuel
    Boukli Hacene, Ghouthi
    Pegatoquet, Alain
    Miramond, Benoit
    Gripon, Vincent
    [J]. SENSORS, 2021, 21 (09)
  • [2] Deep Learning Challenges and Prospects in Wireless Sensor Network Deployment
    Qiu, Yaner
    Ma, Liyun
    Priyadarshi, Rahul
    [J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (6) : 3231 - 3254
  • [3] TinyOps: ImageNet Scale Deep Learning on Microcontrollers
    Sadiq, Sulaiman
    Hare, Jonathon
    Maji, Partha
    Craske, Simon
    Merrett, Geoff, V
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2701 - 2705
  • [4] Deep Learning Deployment
    Nirgudkar, Shailesh
    Modi, Sankalp
    [J]. PROCEEDINGS OF THE 2020 IEEE DALLAS CIRCUITS AND SYSTEMS CONFERENCE (DCAS 2020), 2020,
  • [5] Automated deep-learning model optimization framework for microcontrollers
    Hong, Seungtae
    Park, Gunju
    Kim, Jeong-Si
    [J]. ETRI JOURNAL, 2024,
  • [6] Deployment of APIs on Android Mobile Devices and Microcontrollers
    Laso, Sergio
    Linaje, Marino
    Garcia-Alonso, Jose
    Murillo, Juan M.
    Berrocal, Javier
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2020,
  • [7] An Empirical Study on Deployment Faults of Deep Learning Based Mobile Applications
    Chen, Zhenpeng
    Yao, Huihan
    Lou, Yiling
    Cao, Yanbin
    Liu, Yuanqiang
    Wang, Haoyu
    Liu, Xuanzhe
    [J]. 2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2021), 2021, : 674 - 685
  • [8] Prospective Deployment of Deep Learning inMRI: A Framework for Important Considerations, Challenges, and Recommendations for Best Practices
    Chaudhari, Akshay S.
    Sandino, Christopher M.
    Cole, Elizabeth K.
    Larson, David B.
    Gold, Garry E.
    Vasanawala, Shreyas S.
    Lungren, Matthew P.
    Hargreaves, Brian A.
    Langlotz, Curtis P.
    [J]. JOURNAL OF MAGNETIC RESONANCE IMAGING, 2021, 54 (02) : 357 - 371
  • [9] Faulty or Ready? Handling Failures in Deep-Learning Computer Vision Models until Deployment: A Study of Practices, Challenges, and Needs
    Balayn, Agathe
    Rikalo, Natasa
    Yang, Jie
    Bozzon, Alessandro
    [J]. PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023), 2023,
  • [10] An Empirical Study of Challenges in Converting Deep Learning Models
    Openja, Moses
    Nikanjam, Amin
    Yahmed, Ahmed Haj
    Khomh, Foutse
    Jiang, Zhen Ming
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 13 - 23