SMALL-FOOTPRINT KEYWORD SPOTTING USING DEEP NEURAL NETWORKS

被引:0
|
作者
Chen, Guoguo [1 ]
Parada, Carolina [2 ]
Heigold, Georg [2 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
[2] Google Inc, Mountain View, CA USA
关键词
Deep Neural Network; Keyword Spotting; Embedded Speech Recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Our application requires a keyword spotting system with a small memory footprint, low computational cost, and high precision. To meet these requirements, we propose a simple approach based on deep neural networks. A deep neural network is trained to directly predict the keyword(s) or subword units of the keyword(s) followed by a posterior handling method producing a final confidence score. Keyword recognition results achieve 45% relative improvement with respect to a competitive Hidden Markov Model-based system, while performance in the presence of babble noise shows 39% relative improvement.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Error-Diffusion Based Speech Feature Quantization for Small-Footprint Keyword Spotting
    Luo, Mengjie
    Wang, Dingyi
    Wang, Xiaoqin
    Qiao, Shushan
    Zhou, Yumei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1357 - 1361
  • [32] Attention-based End-to-End Models for Small-Footprint Keyword Spotting
    Shan, Changhao
    Zhang, Junbo
    Wang, Yujun
    Xie, Lei
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2037 - 2041
  • [33] Improved Small-Footprint ASR-Based Solution for Open Vocabulary Keyword Spotting
    Pudo, Mikolaj
    Wosik, Mateusz
    Janicki, Artur
    [J]. IEEE ACCESS, 2024, 12 : 91289 - 91299
  • [34] Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition
    Lu, Liang
    Renals, Steve
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 12 - 16
  • [35] VIRTUAL ADVERSARIAL TRAINING FOR DS-CNN BASED SMALL-FOOTPRINT KEYWORD SPOTTING
    Wang, Xiong
    Sun, Sining
    Xie, Lei
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 607 - 612
  • [36] MAX-POOLING LOSS TRAINING OF LONG SHORT-TERM MEMORY NETWORKS FOR SMALL-FOOTPRINT KEYWORD SPOTTING
    Sun, Ming
    Raju, Anirudh
    Tucker, George
    Panchapagesan, Sankaran
    Fu, Gengshen
    Mandal, Arindam
    Matsoukas, Spyros
    Strom, Nikko
    Vitaladevuni, Shiv
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 474 - 480
  • [37] FOCAL LOSS AND DOUBLE-EDGE-TRIGGERED DETECTOR FOR ROBUST SMALL-FOOTPRINT KEYWORD SPOTTING
    Liu, Bin
    Nie, Shuai
    Zhang, Yaping
    Liang, Shan
    Yang, Zhanlei
    Liu, Wenju
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6361 - 6365
  • [38] Handwritten keyword spotting using deep neural networks and certainty prediction
    Daraee, Fatemeh
    Mozaffari, Saeed
    Razavi, Seyyed Mohammad
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
  • [39] Depthwise Separable Convolutional ResNet with Squeeze-and-Excitation Blocks for Small-footprint Keyword Spotting
    Xu, Menglong
    Zhang, Xiao-Lei
    [J]. INTERSPEECH 2020, 2020, : 2547 - 2551
  • [40] An empirical study of cross-lingual transfer learning techniques for small-footprint keyword spotting
    Sun, Ming
    Schwarz, Andreas
    Wu, Minhua
    Strom, Nikko
    Matsoukas, Spyros
    Vitaladevuni, Shiv
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 255 - 260