An application of recurrent neural networks to discriminative keyword spotting

被引:0
|
作者
Fernandez, Santiago [1 ]
Graves, Alex [1 ]
Schmidhuber, Juergen [1 ,2 ]
机构
[1] IDSIA, Galleria 2, CH-6928 Manno Lugano, Switzerland
[2] Tech Univ Munich, D-85748 Garching, Munich, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of keyword spotting is to detect the presence of specific spoken words in unconstrained speech. The majority of keyword spotting systems are based on generative hidden Markov models and lack discriminative capabilities. However, discriminative keyword spotting systems are currently based on frame-level posterior probabilities of sub-word units. This paper presents a discriminative keyword spotting system based on recurrent neural networks only, that uses information from long time spans to estimate word-level posterior probabilities. In a keyword spotting task on a large database of unconstrained speech the system achieved a keyword spotting accuracy of 84.5 %.
引用
收藏
页码:220 / +
页数:3
相关论文
共 50 条
  • [41] Few-Shot Keyword Spotting With Prototypical Networks
    Parnami, Archit
    Lee, Minwoo
    [J]. PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 277 - 283
  • [42] Reduced Model Size Deep Convolutional Neural Networks for Small-Footprint Keyword Spotting
    Tsai, Tsung Han
    Lin, Xin Hui
    [J]. 2021 28TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (IEEE ICECS 2021), 2021,
  • [43] Behavior of Keyword Spotting Networks Under Noisy Conditions
    Mohanty, Anwesh
    Frischknecht, Adrian
    Gerum, Christoph
    Bringmann, Oliver
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT I, 2021, 12891 : 369 - 378
  • [44] Subsyllable-based discriminative segmental Bayesian network for Mandarin speech keyword spotting
    Wu, CH
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1997, 144 (02): : 65 - 71
  • [45] A Fast Algorithm for Large Vocabulary Keyword Spotting Application
    Huang, Eng-Fong
    Wang, Hsiao-Chuan
    Soong, Frank K.
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 449 - 452
  • [46] Discriminative Training Using Non-uniform Criteria for Keyword Spotting on Spontaneous Speech
    Weng, Chao
    Juang, Biing-Hwang
    Povey, Daniel
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 558 - 561
  • [47] Discriminative Training Using Non-Uniform Criteria for Keyword Spotting on Spontaneous Speech
    Weng, Chao
    Juang, Biing-Hwang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (02) : 300 - 312
  • [48] BIFOCAL NEURAL ASR: EXPLOITING KEYWORD SPOTTING FOR INFERENCE OPTIMIZATION
    Macoskey, Jon
    Strimel, Grant P.
    Rastrow, Ariya
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5999 - 6003
  • [49] Keyword spotting in continuous speech using convolutional neural network
    Rostami, Amir Mohammad
    Karimi, Ali
    Akhaee, Mohammad Ali
    [J]. SPEECH COMMUNICATION, 2022, 142 : 15 - 21
  • [50] Keyword spotting in continuous speech using convolutional neural network
    Rostami, Amir Mohammad
    Karimi, Ali
    Akhaee, Mohammad Ali
    [J]. Speech Communication, 2022, 142 : 15 - 21