Error-Diffusion Based Speech Feature Quantization for Small-Footprint Keyword Spotting

被引:1
|
作者
Luo, Mengjie [1 ,2 ]
Wang, Dingyi [1 ]
Wang, Xiaoqin [1 ,2 ]
Qiao, Shushan [1 ,2 ]
Zhou, Yumei [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100029, Peoples R China
关键词
Quantization (signal); Task analysis; Spectrogram; Signal processing algorithms; Filter banks; Standards; Speech processing; Keyword spotting; speech feature quantization; error diffusion; image processing; convolutional neural networks;
D O I
10.1109/LSP.2022.3179208
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Neural network based keyword spotting (KWS) system is a critical component for user interaction in current smart devices. Although small-footprint networks have been widely explored to reduce deployment overhead, low-precision input feature representation still lacks in-depth research. In this letter, an error-diffusion based speech feature quantization method is proposed. Specifically, our algorithm adapts image processing to quantize the input speech feature maps in arbitrary bits. Experiments show that in the 10-keyword KWS task, our 3-bit representation only brings a 0.45% average accuracy drop compared to the full-precision log-Mel spectrograms while others drop over 3%. In the 2 keywords task, our 3-bit representation produces no significant differences, while 1-bit quantization only leads to an average of 1.7% accuracy drop and is even capable of handling similar keywords and imbalanced data distribution. The result proves our method, to the best of our knowledge, is the first practical method that supports as low as 1-bit quantization for single-channel speech features in small-footprint KWS. In addition, we analyze the impact of error-diffusion directions and conclude that time-direction diffusion is more suitable for temporal convolutional networks.
引用
收藏
页码:1357 / 1361
页数:5
相关论文
共 50 条
  • [21] VIRTUAL ADVERSARIAL TRAINING FOR DS-CNN BASED SMALL-FOOTPRINT KEYWORD SPOTTING
    Wang, Xiong
    Sun, Sining
    Xie, Lei
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 607 - 612
  • [22] STREAMING SMALL-FOOTPRINT KEYWORD SPOTTING USING SEQUENCE-TO-SEQUENCE MODELS
    He, Yanzhang
    Prabhavalkar, Rohit
    Rao, Kanishka
    Li, Wei
    Bakhtin, Anton
    McGraw, Ian
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 474 - 481
  • [23] Domain Aware Training for Far-field Small-footprint Keyword Spotting
    Wu, Haiwei
    Jia, Yan
    Nie, Yuanfei
    Li, Ming
    [J]. INTERSPEECH 2020, 2020, : 2562 - 2566
  • [24] SMALL-FOOTPRINT KEYWORD SPOTTING ON RAW AUDIO DATA WITH SINC-CONVOLUTIONS
    Mittermaier, Simon
    Kuerzinger, Ludwig
    Waschneck, Bernd
    Rigoll, Gerhard
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7454 - 7458
  • [25] Small-Footprint Keyword Spotting Based on Gated Channel Transformation Sandglass Residual Neural Network
    Zhang, Ying
    Zhu, Shirong
    Yu, Chao
    Zhao, Lasheng
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (07)
  • [26] Small-footprint Spiking Neural Networks for Power-efficient Keyword Spotting
    Pedroni, Bruno U.
    Sheik, Sadique
    Mostafa, Hesham
    Paul, Somnath
    Augustine, Charles
    Cauwenberghs, Gert
    [J]. 2018 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS): ADVANCED SYSTEMS FOR ENHANCING HUMAN HEALTH, 2018, : 591 - 594
  • [27] ADVERSARIAL EXAMPLES FOR IMPROVING END-TO-END ATTENTION-BASED SMALL-FOOTPRINT KEYWORD SPOTTING
    Wang, Xiong
    Sun, Sining
    Shan, Changhao
    Hou, Jingyong
    Xie, Lei
    Li, Shen
    Lei, Xin
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6366 - 6370
  • [28] Reduced Model Size Deep Convolutional Neural Networks for Small-Footprint Keyword Spotting
    Tsai, Tsung Han
    Lin, Xin Hui
    [J]. 2021 28TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (IEEE ICECS 2021), 2021,
  • [29] Feature reduction based transfer structural subspace learning for small-footprint cross-domain keyword spotting via linear discriminant analysis
    Ma, Fei
    Wang, Chengliang
    Hao, Yujie
    Wu, Xing
    [J]. DIGITAL SIGNAL PROCESSING, 2022, 127
  • [30] Small-Footprint Keyword Spotting for Controlling Smart Home Appliances Using TCN and CRNN Models
    Alapati, Hemalatha
    Paolini, Christopher
    Chinara, Suchismita
    Sarkar, Mahasweta
    [J]. INTERNATIONAL JOURNAL OF INTERDISCIPLINARY TELECOMMUNICATIONS AND NETWORKING, 2022, 14 (01)