Human Activity Recognition on Microcontrollers with Quantized and Adaptive Deep Neural Networks

被引:18
|
作者
Daghero, Francesco [1 ]
Burrello, Alessio [2 ]
Xie, Chen [1 ]
Castellano, Marco [3 ]
Gandolfi, Luca [3 ]
Calimera, Andrea [1 ]
Macii, Enrico [1 ]
Poncino, Massimo [1 ]
Pagliari, Daniele Jahier [1 ]
机构
[1] Politecn Torino, I-10129 Turin, Italy
[2] Univ Bologna, I-40136 Bologna, Italy
[3] STMicroelectronics, I-20010 Cornaredo, Italy
关键词
Quantized neural networks; mixed precision; adaptive neural networks; human activity recognition; edge computing; energy efficiency; LOW-POWER; HYBRID; SENSOR;
D O I
10.1145/3542819
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Human Activity Recognition (HAR) based on inertial data is an increasingly diffused task on embedded devices, from smartphones to ultra low-power sensors. Due to the high computational complexity of deep learning models, most embedded HAR systems are based on simple and not-so-accurate classic machine learning algorithms. This work bridges the gap between on-device HAR and deep learning, proposing a set of efficient one-dimensional Convolutional Neural Networks (CNNs) that can be deployed on general purpose microcontrollers (MCUs). Our CNNs are obtained combining hyper-parameters optimization with sub-byte and mixed-precision quantization, to find good trade-offs between classification results and memory occupation. Moreover, we also leverage adaptive inference as an orthogonal optimization to tune the inference complexity at runtime based on the processed input, hence producing a more flexible HAR system. With experiments on four datasets, and targeting an ultra-low-power RISC-V MCU, we show that (i) we are able to obtain a rich set of Pareto-optimal CNNs for HAR, spanning more than 1 order of magnitude in terms of memory, latency, and energy consumption; (ii) thanks to adaptive inference, we can derive >20 runtime operating modes starting from a single CNN, differing by up to 10% in classification scores and by more than 3x in inference complexity, with a limited memory overhead; (iii) on three of the four benchmarks, we outperform all previous deep learning methods, while reducing the memory occupation by more than 100x. The few methods that obtain better performance (both shallow and deep) are not compatible with MCU deployment; (iv) all our CNNs are compatible with real-time on-device HAR, achieving an inference latency that ranges between 9 mu s and 16 ms. Their memory occupation varies in 0.05-23.17 kB, and their energy consumption in 0.05 and 61.59 mu J, allowing years of continuous operation on a small battery supply.
引用
收藏
页数:28
相关论文
共 50 条
  • [21] AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos
    Kar, Amlan
    Rai, Nishant
    Sikka, Karan
    Sharma, Gaurav
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5699 - 5708
  • [22] Human motion activity recognition and pattern analysis using compressed deep neural networks
    Kumari, Navita
    Yadagani, Amulya
    Behera, Basudeba
    Semwal, Vijay Bhaskar
    Mohanty, Somya
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2024, 12 (01):
  • [23] Optimum signal duration for Human Activity Recognition based on Deep Convolutional Neural Networks
    Nazari, Farhad
    Shajari, Arian
    Nahavandi, Darius
    18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024, 2024,
  • [24] Human Activity Recognition Using Smartphone Sensor Data Via Deep Neural Networks
    Chen, Yuwen
    Zhong, Kunhua
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING APPLICATIONS (CSEA 2015), 2015, : 348 - 353
  • [25] Human Activity Recognition in Thermal Infrared Imaging Based on Deep Recurrent Neural Networks
    Manssor, Samah A. F.
    Ren, Zhengyun
    Huang, Rong
    Sun, Shaoyuan
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [26] Multiview fusion for activity recognition using deep neural networks
    Kavi, Rahul
    Kulathumani, Vinod
    Rohit, Fnu
    Kecojevic, Vlad
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (04)
  • [27] On the Benefits of Deep Convolutional Neural Networks on Animal Activity Recognition
    Bocaj, Enkeleda
    Uzunidis, Dimitris
    Kasnesis, Panagiotis
    Patrikakis, Charalampos Z.
    PROCEEDINGS OF 2020 INTERNATIONAL CONFERENCE ON SMART SYSTEMS AND TECHNOLOGIES (SST 2020), 2020, : 83 - 88
  • [28] Speeding up Deep Neural Networks in Speech Recognition with Piecewise Quantized Sigmoidal Activation Function
    Xing, Anhao
    Zhao, Qingwei
    Yan, Yonghong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2558 - 2561
  • [29] Human Activity Recognition Using Deep Belief Networks
    Yalcin, Hulya
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1649 - 1652
  • [30] Human Activity Recognition Using Convolutional Neural Networks
    Dogan, Gulustan
    Ertas, Sinem Sena
    Cay, Iremnaz
    2021 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2021, : 76 - 80