An application of recurrent neural networks to discriminative keyword spotting

被引:0
|
作者
Fernandez, Santiago [1 ]
Graves, Alex [1 ]
Schmidhuber, Juergen [1 ,2 ]
机构
[1] IDSIA, Galleria 2, CH-6928 Manno Lugano, Switzerland
[2] Tech Univ Munich, D-85748 Garching, Munich, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of keyword spotting is to detect the presence of specific spoken words in unconstrained speech. The majority of keyword spotting systems are based on generative hidden Markov models and lack discriminative capabilities. However, discriminative keyword spotting systems are currently based on frame-level posterior probabilities of sub-word units. This paper presents a discriminative keyword spotting system based on recurrent neural networks only, that uses information from long time spans to estimate word-level posterior probabilities. In a keyword spotting task on a large database of unconstrained speech the system achieved a keyword spotting accuracy of 84.5 %.
引用
收藏
页码:220 / +
页数:3
相关论文
共 50 条
  • [1] Discriminative keyword spotting
    Keshet, Joseph
    Grangier, David
    Bengio, Samy
    [J]. SPEECH COMMUNICATION, 2009, 51 (04) : 317 - 329
  • [2] Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
    Arik, Sercan O.
    Kliegl, Markus
    Child, Rewon
    Hestness, Joel
    Gibiansky, Andrew
    Fougner, Chris
    Prenger, Ryan
    Coates, Adam
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1606 - 1610
  • [3] A RECURRENT NEURAL NETWORKS APPROACH FOR KEYWORD SPOTTING APPLIED ON ROMANIAN LANGUAGE
    Pipa, Sonia
    Boros, Tiberiu
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2016, : 111 - 120
  • [4] Keyword spotting based on recurrent neural network
    Zhou, JL
    Liu, J
    Song, YT
    Yu, TC
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 710 - 713
  • [5] Deep Convolutional Spiking Neural Networks for Keyword Spotting
    Yilmaz, Emre
    Gevrek, Ozgur Bora
    Wu, Jibin
    Chen, Yuxiang
    Meng, Xuanbo
    Li, Haizhou
    [J]. INTERSPEECH 2020, 2020, : 2557 - 2561
  • [6] A survey on structured discriminative spoken keyword spotting
    Tabibian, Shima
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (04) : 2483 - 2520
  • [7] A survey on structured discriminative spoken keyword spotting
    Shima Tabibian
    [J]. Artificial Intelligence Review, 2020, 53 : 2483 - 2520
  • [8] Convolutional Neural Networks for Small-footprint Keyword Spotting
    Sainath, Tara N.
    Parada, Carolina
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1478 - 1482
  • [9] Combining Neural Networks to Improve Performance of Handwritten Keyword Spotting
    Frinken, Volkmar
    Fischer, Andreas
    Bunke, Horst
    [J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2010, 5997 : 215 - 224
  • [10] Efficient keyword spotting using time delay neural networks
    Myer, Samuel
    Tomar, Vikrant Singh
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1264 - 1268