Evaluating Robustness to Noise and Compression of Deep Neural Networks for Keyword Spotting

被引：2

作者：

Pereira, Pedro H. ^{[1
]}

Beccaro, Wesley ^{[1
]}

Ramirez, Miguel A. ^{[1
]}

机构：

[1] Univ Sao Paulo, Dept Elect Syst Engn, Escola Politecn, BR-05508010 Sao Paulo, Brazil

来源：

IEEE ACCESS | 2023年 / 11卷

基金：

巴西圣保罗研究基金会;

关键词：

Speech recognition; machine learning algorithms; speech analysis; spectral analysis; pruning; quantization; keyword spotting; RECOGNITION; ALGORITHM;

D O I：

10.1109/ACCESS.2023.3280477

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Keyword Spotting (KWS) has been the subject of research in recent years given the increase of embedded systems for command recognition such as Alexa, Google Home, and Siri. Performance, model size, processing time, and robustness to noise are fundamental in these systems. Furthermore, applications in embedded systems demand computationally efficient models that can be implemented in current technology. In this work, an approach for keyword recognition is evaluated using three deep learning models namely LeNet-5, SqueezeNet, and EfficientNet-B0. We evaluate transfer learning, pruning and quantization strategies in training and test using noisy and clean speech signals. In addition, compression techniques such as pruning and quantization were assessed in terms of the size reduction of the model footprint and the accuracy obtained in each case. Using the Google's Speech Commands dataset and additive babble noise signal, our keyword recognition approach achieves an accuracy of 94.6% using an unstructured pruning of 80% of the parameters of the original SqueezeNet network with a reduction of 70% in the model size.

引用

页码：53224 / 53236

页数：13

共 50 条

[1] Deep Convolutional Spiking Neural Networks for Keyword Spotting
Yilmaz, Emre
Gevrek, Ozgur Bora
Wu, Jibin
Chen, Yuxiang
Meng, Xuanbo
Li, Haizhou
[J]. INTERSPEECH 2020, 2020, : 2557 - 2561
[2] SMALL-FOOTPRINT KEYWORD SPOTTING USING DEEP NEURAL NETWORKS
Chen, Guoguo
Parada, Carolina
Heigold, Georg
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[3] Handwritten keyword spotting using deep neural networks and certainty prediction
Daraee, Fatemeh
Mozaffari, Saeed
Razavi, Seyyed Mohammad
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
[4] Analyzing the Noise Robustness of Deep Neural Networks
Liu, Mengchen
Liu, Shixia
Su, Hang
Cao, Kelei
Zhu, Jun
[J]. 2018 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2018, : 60 - 71
[5] Analyzing the Noise Robustness of Deep Neural Networks
Cao, Kelei
Liu, Mengchen
Su, Hang
Wu, Jing
Zhu, Jun
Liu, Shixia
[J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (07) : 3289 - 3304
[6] Evaluating the Robustness of Ultrasound Beamforming with Deep Neural Networks
Luchies, Adam
Byram, Brett
[J]. 2018 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2018,
[7] Non-Uniform Boosted MCE Training of Deep Neural Networks for Keyword Spotting
Meng, Zhong
Juang, Biing-Hwang
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 770 - 774
[8] An application of recurrent neural networks to discriminative keyword spotting
Fernandez, Santiago
Graves, Alex
Schmidhuber, Juergen
[J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2007, PT 2, PROCEEDINGS, 2007, 4669 : 220 - +
[9] Enhancing the Robustness of Deep Neural Networks from "Smart" Compression
Liu, Tao
Liu, Zihao
Liu, Qi
Wen, Wujie
[J]. 2018 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI), 2018, : 528 - 532
[10] Efficient Keyword Spotting through Hardware-Aware Conditional Execution of Deep Neural Networks
Giraldo, J. S. P.
O'Connor, Chris
Verhelst, Marian
[J]. 2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,

← 1 2 3 4 5 →