Evaluating Robustness to Noise and Compression of Deep Neural Networks for Keyword Spotting

被引:2
|
作者
Pereira, Pedro H. [1 ]
Beccaro, Wesley [1 ]
Ramirez, Miguel A. [1 ]
机构
[1] Univ Sao Paulo, Dept Elect Syst Engn, Escola Politecn, BR-05508010 Sao Paulo, Brazil
基金
巴西圣保罗研究基金会;
关键词
Speech recognition; machine learning algorithms; speech analysis; spectral analysis; pruning; quantization; keyword spotting; RECOGNITION; ALGORITHM;
D O I
10.1109/ACCESS.2023.3280477
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Keyword Spotting (KWS) has been the subject of research in recent years given the increase of embedded systems for command recognition such as Alexa, Google Home, and Siri. Performance, model size, processing time, and robustness to noise are fundamental in these systems. Furthermore, applications in embedded systems demand computationally efficient models that can be implemented in current technology. In this work, an approach for keyword recognition is evaluated using three deep learning models namely LeNet-5, SqueezeNet, and EfficientNet-B0. We evaluate transfer learning, pruning and quantization strategies in training and test using noisy and clean speech signals. In addition, compression techniques such as pruning and quantization were assessed in terms of the size reduction of the model footprint and the accuracy obtained in each case. Using the Google's Speech Commands dataset and additive babble noise signal, our keyword recognition approach achieves an accuracy of 94.6% using an unstructured pruning of 80% of the parameters of the original SqueezeNet network with a reduction of 70% in the model size.
引用
收藏
页码:53224 / 53236
页数:13
相关论文
共 50 条
  • [1] Deep Convolutional Spiking Neural Networks for Keyword Spotting
    Yilmaz, Emre
    Gevrek, Ozgur Bora
    Wu, Jibin
    Chen, Yuxiang
    Meng, Xuanbo
    Li, Haizhou
    [J]. INTERSPEECH 2020, 2020, : 2557 - 2561
  • [2] SMALL-FOOTPRINT KEYWORD SPOTTING USING DEEP NEURAL NETWORKS
    Chen, Guoguo
    Parada, Carolina
    Heigold, Georg
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Handwritten keyword spotting using deep neural networks and certainty prediction
    Daraee, Fatemeh
    Mozaffari, Saeed
    Razavi, Seyyed Mohammad
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
  • [4] Analyzing the Noise Robustness of Deep Neural Networks
    Liu, Mengchen
    Liu, Shixia
    Su, Hang
    Cao, Kelei
    Zhu, Jun
    [J]. 2018 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2018, : 60 - 71
  • [5] Analyzing the Noise Robustness of Deep Neural Networks
    Cao, Kelei
    Liu, Mengchen
    Su, Hang
    Wu, Jing
    Zhu, Jun
    Liu, Shixia
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (07) : 3289 - 3304
  • [6] Evaluating the Robustness of Ultrasound Beamforming with Deep Neural Networks
    Luchies, Adam
    Byram, Brett
    [J]. 2018 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2018,
  • [7] Non-Uniform Boosted MCE Training of Deep Neural Networks for Keyword Spotting
    Meng, Zhong
    Juang, Biing-Hwang
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 770 - 774
  • [8] An application of recurrent neural networks to discriminative keyword spotting
    Fernandez, Santiago
    Graves, Alex
    Schmidhuber, Juergen
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2007, PT 2, PROCEEDINGS, 2007, 4669 : 220 - +
  • [9] Enhancing the Robustness of Deep Neural Networks from "Smart" Compression
    Liu, Tao
    Liu, Zihao
    Liu, Qi
    Wen, Wujie
    [J]. 2018 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2018, : 528 - 532
  • [10] Efficient Keyword Spotting through Hardware-Aware Conditional Execution of Deep Neural Networks
    Giraldo, J. S. P.
    O'Connor, Chris
    Verhelst, Marian
    [J]. 2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,