Efficient deep neural network compression for environmental sound classification on microcontroller units

被引:0
|
作者
Chen, Shan [1 ]
Meng, Na [1 ]
Li, Haoyuan [1 ]
Fang, Weiwei [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
[2] Hubei Engn Res Ctr Intelligent Detect & Identifica, Wuhan, Hubei, Peoples R China
基金
美国国家科学基金会;
关键词
Environmental sound classification; deep neural networks; microcontroller units; knowledge distillation;
D O I
10.55730/1300-0632.4084
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Environmental sound classification (ESC) is one of the important research topics within the nonspeech audio classification field. While deep neural networks (DNNs) have achieved significant advances in ESC recently, their high computational and memory demands render them highly unsuitable for direct deployment on resource-constrained Internet of Things (IoT) devices based on microcontroller units (MCUs). To address this challenge, we propose a novel DNN compression framework specifically designed for such devices. On the one hand, we leverage pruning techniques to significantly compress the large number of model parameters in DNNs. To reduce the accuracy loss that follows pruning, we propose a knowledge distillation scheme based on feature information from multiple intermediate layers. On the other hand, we design a two-stage quantization-aware knowledge distillation scheme to mitigate the accuracy degradation of mandatory quantization required by MCU hardware. We evaluate our framework on benchmark ESC datasets (UrbanSound8K, ESC-50) using the STM32F746ZG device. The experimental results demonstrate that our framework can achieve compression rates up to 97% while maintaining competitive inference performance compared to the uncompressed baseline.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] An Ensemble One Dimensional Convolutional Neural Network with Bayesian Optimization for Environmental Sound Classification
    Ragab, Mohammed Gamal
    Abdulkadir, Said Jadid
    Aziz, Norshakirah
    Alhussian, Hitham
    Bala, Abubakar
    Alqushaibi, Alawi
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (10):
  • [42] Power Quality Disturbances Classification Based on Wavelet Compression and Deep Convolutional Neural Network
    Berutu, Sunneng Sandino
    Chen, Yeong-Chin
    [J]. 2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 327 - 330
  • [43] An Efficient Approach to Fruit Classification and Grading using Deep Convolutional Neural Network
    Pande, Aditi
    Munot, Mousami
    Sreeemathy, R.
    Bakare, R., V
    [J]. 2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [44] An Efficient Deep Neural Network Binary Classifier for Alzheimer's Disease Classification
    Prajapati, Rukesh
    Khatri, Uttam
    Kwon, Goo Rak
    [J]. 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 231 - 234
  • [45] Efficient Gastrointestinal Disease Classification Using Pretrained Deep Convolutional Neural Network
    Noor, Muhammad Nouman
    Nazir, Muhammad
    Khan, Sajid Ali
    Song, Oh-Young
    Ashraf, Imran
    [J]. ELECTRONICS, 2023, 12 (07)
  • [46] Optimizing Convolutional Neural Networks for Image Classification on Resource-Constrained Microcontroller Units
    Brockmann, Susanne
    Schlippe, Tim
    [J]. COMPUTERS, 2024, 13 (07)
  • [47] Efficient Deep Neural Network for Digital Image Compression Employing Rectified Linear Neurons
    Hussain, Farhan
    Jeong, Jechang
    [J]. JOURNAL OF SENSORS, 2016, 2016
  • [48] A New Deep CNN Model for Environmental Sound Classification
    Demir, Fatih
    Abdullah, Daban Abdulsalam
    Sengur, Abdulkadir
    [J]. IEEE ACCESS, 2020, 8 : 66529 - 66537
  • [49] Environmental sound sources classification using neural networks
    Stoeckle, S
    Pah, N
    Kumar, DK
    McLachlan, N
    [J]. ANZIIS 2001: PROCEEDINGS OF THE SEVENTH AUSTRALIAN AND NEW ZEALAND INTELLIGENT INFORMATION SYSTEMS CONFERENCE, 2001, : 399 - 403
  • [50] Masked Conditional Neural Networks for Environmental Sound Classification
    Medhat, Fady
    Chesmore, David
    Robinson, John
    [J]. ARTIFICIAL INTELLIGENCE XXXIV, AI 2017, 2017, 10630 : 21 - 33