Efficient deep neural network compression for environmental sound classification on microcontroller units

被引:0
|
作者
Chen, Shan [1 ]
Meng, Na [1 ]
Li, Haoyuan [1 ]
Fang, Weiwei [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
[2] Hubei Engn Res Ctr Intelligent Detect & Identifica, Wuhan, Hubei, Peoples R China
基金
美国国家科学基金会;
关键词
Environmental sound classification; deep neural networks; microcontroller units; knowledge distillation;
D O I
10.55730/1300-0632.4084
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Environmental sound classification (ESC) is one of the important research topics within the nonspeech audio classification field. While deep neural networks (DNNs) have achieved significant advances in ESC recently, their high computational and memory demands render them highly unsuitable for direct deployment on resource-constrained Internet of Things (IoT) devices based on microcontroller units (MCUs). To address this challenge, we propose a novel DNN compression framework specifically designed for such devices. On the one hand, we leverage pruning techniques to significantly compress the large number of model parameters in DNNs. To reduce the accuracy loss that follows pruning, we propose a knowledge distillation scheme based on feature information from multiple intermediate layers. On the other hand, we design a two-stage quantization-aware knowledge distillation scheme to mitigate the accuracy degradation of mandatory quantization required by MCU hardware. We evaluate our framework on benchmark ESC datasets (UrbanSound8K, ESC-50) using the STM32F746ZG device. The experimental results demonstrate that our framework can achieve compression rates up to 97% while maintaining competitive inference performance compared to the uncompressed baseline.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Deep Convolutional Neural Network with Mixup for Environmental Sound Classification
    Zhang, Zhichao
    Xu, Shugong
    Cao, Shan
    Zhang, Shunqing
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 356 - 367
  • [2] Deep Convolutional Neural Network with Transfer Learning for Environmental Sound Classification
    Lu, Jianrui
    Ma, Ruofei
    Liu, Gongliang
    Qin, Zhiliang
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS (ICCCR 2021), 2021, : 242 - 245
  • [3] Deep convolutional neural network for environmental sound classification via dilation
    Roy, Sanjiban Sekhar
    Mihalache, Sanda Florentina
    Pricop, Emil
    Rodrigues, Nishant
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (02) : 1827 - 1833
  • [4] Deep Convolutional Neural Network Combined with Concatenated Spectrogram for Environmental Sound Classification
    Chi, Zhejian
    Li, Ying
    Chen, Cheng
    [J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 251 - 254
  • [5] Environmental sound classification using a regularized deep convolutional neural network with data augmentation
    Mushtaq, Zohaib
    Su, Shun-Feng
    [J]. APPLIED ACOUSTICS, 2020, 167
  • [6] Efficient Tunstall Decoder for Deep Neural Network Compression
    Chen, Chunyun
    Wang, Zhe
    Chen, Xiaowei
    Lin, Jie
    Aly, Mohamed M. Sabry
    [J]. 2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 1021 - 1026
  • [7] Dilated Convolution Neural Network with LeakyReLU for Environmental Sound Classification
    Zhang, Xiaohu
    Zou, Yuexian
    Shi, Wei
    [J]. 2017 22ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2017,
  • [8] Deep artificial neural network based on environmental sound data for the generation of a children activity classification model
    Garcia-Dominguez, Antonio
    Galvan-Tejada, Carlos E.
    Zanella-Calzada, Laura A.
    Gamboa, Hamurabi
    Galvan-Tejada, Jorge, I
    Celaya Padilla, Jose Maria
    Luna-Garcia, Huizilopoztli
    Arceo-Olague, Jose G.
    Magallanes-Quintanar, Rafael
    [J]. PEERJ COMPUTER SCIENCE, 2020, PeerJ Inc. (06)
  • [9] Sound Classification Using Convolutional Neural Network and Tensor Deep Stacking Network
    Khamparia, Aditya
    Gupta, Deepak
    Nhu Gia Nguyen
    Khanna, Ashish
    Pandey, Babita
    Tiwari, Prayag
    [J]. IEEE ACCESS, 2019, 7 : 7717 - 7727
  • [10] Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
    Salamon, Justin
    Bello, Juan Pablo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) : 279 - 283