Compact Feedforward Sequential Memory Networks for Small-footprint Keyword Spotting

被引:6
|
作者
Chen, Mengzhe [1 ]
Zhang, Shiliang [1 ]
Lei, Ming [1 ]
Liu, Yong [1 ]
Yao, Haitao [1 ]
Gao, Jie [1 ]
机构
[1] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
关键词
keyword spotting; compact feedforward sequential memory network; multiframe prediction; small-footprint;
D O I
10.21437/Interspeech.2018-1204
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to limited resource on devices and complicated scenarios, a compact model with high precision, low computational cost and latency is expected for small-footprint keyword spotting tasks. To fulfill these requirements, in this paper, compact Feed forward Sequential Memory Network (cFSMN) which combines low-rank matrix factorization with conventional FSMN is investigated for a far-field keyword spotting task. The effect of its architecture parameters is analyzed. Towards achieving lower computational cost, multiframe prediction (MW) is applied to cFSMN. For enhancing the modeling capacity, an advanced MW is attempted by inserting small DNN layers before output layers. The performance is measured by area under the curve (AUC) for detection error tradeoff (DET) curves. The experiments show that compared with a well-tuned long short-term memory (LSTM) which needs the same latency and twofold computational cost, the cFSMN achieves 18.11% and 29.21% AUC relative decreases on the test sets which are recorded in quiet and noisy environment respectively. After applying advanced MFP, the system gets 0.48% and 20.04% AUC relative decrease over conventional cFSMN on the quiet and noisy test sets respectively, while the computational cost relatively reduces 46.58%.
引用
收藏
页码:2663 / 2667
页数:5
相关论文
共 50 条
  • [31] Depthwise Separable Convolutional ResNet with Squeeze-and-Excitation Blocks for Small-footprint Keyword Spotting
    Xu, Menglong
    Zhang, Xiao-Lei
    [J]. INTERSPEECH 2020, 2020, : 2547 - 2551
  • [32] An empirical study of cross-lingual transfer learning techniques for small-footprint keyword spotting
    Sun, Ming
    Schwarz, Andreas
    Wu, Minhua
    Strom, Nikko
    Matsoukas, Spyros
    Vitaladevuni, Shiv
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 255 - 260
  • [33] Small-Footprint Keyword Spotting Based on Gated Channel Transformation Sandglass Residual Neural Network
    Zhang, Ying
    Zhu, Shirong
    Yu, Chao
    Zhao, Lasheng
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (07)
  • [34] Joint Framework of Curriculum Learning and Knowledge Distillation for Noise-Robust and Small-Footprint Keyword Spotting
    Lim, Jaebong
    Baek, Yunju
    [J]. IEEE ACCESS, 2023, 11 : 100540 - 100553
  • [35] Multi-class AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data
    Xu, Menglong
    Li, Shengqiang
    Liang, Chengdong
    Zhang, Xiao-Lei
    [J]. INTERSPEECH 2022, 2022, : 3278 - 3282
  • [36] A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting
    Bai, Ye
    Yi, Jiangyan
    Tao, Jianhua
    Wen, Zhengqi
    Tian, Zhengkun
    Zhao, Chenghao
    Fan, Cunhang
    [J]. INTERSPEECH 2019, 2019, : 2190 - 2194
  • [37] Robust Small-Footprint Keyword Spotting Using Sequence-To-Sequence Model With Connectionist Temporal Classifier
    Xuan, Xiaoguang
    Wang, Mingjiang
    Zhang, Xin
    Sun, Fengjiao
    [J]. 2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 400 - 404
  • [38] ADVERSARIAL EXAMPLES FOR IMPROVING END-TO-END ATTENTION-BASED SMALL-FOOTPRINT KEYWORD SPOTTING
    Wang, Xiong
    Sun, Sining
    Shan, Changhao
    Hou, Jingyong
    Xie, Lei
    Li, Shen
    Lei, Xin
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6366 - 6370
  • [39] Selective transfer subspace learning for small-footprint end-to-end cross-domain keyword spotting
    Ma, Fei
    Wang, Chengliang
    Li, Xusheng
    Zeng, Zhuo
    [J]. SPEECH COMMUNICATION, 2024, 156
  • [40] Low-complex and Highly-performed Binary Residual Neural Network for Small-footprint Keyword Spotting
    Wang, Xiao
    Cheng, Song
    Li, Jun
    Qiao, Shushan
    Zhou, Yumei
    Zhan, Yi
    [J]. INTERSPEECH 2022, 2022, : 3233 - 3237