Efficient keyword spotting using time delay neural networks

被引:0
|
作者
Myer, Samuel [1 ]
Tomar, Vikrant Singh [1 ]
机构
[1] Fluent Ai Inc, Montreal, PQ, Canada
关键词
keyword spotting; wake word; time-delay neural network; transfer learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a novel method of live keyword spotting using a two-stage time delay neural network. The model is trained using transfer learning: initial training with phone targets from a large speech corpus is followed by training with keyword targets from a smaller data set. The accuracy of the system is evaluated on two separate tasks. The first is the freely available Google Speech Commands dataset. The second is an in-house task specifically developed for keyword spotting. The results show significant improvements in false accept and false reject rates in both clean and noisy environments when compared with previously known techniques. Furthermore, we investigate various techniques to reduce computation in terms of multiplications per second of audio. Compared to recently published work, the proposed system provides up to 89% savings on computational complexity.
引用
收藏
页码:1264 / 1268
页数:5
相关论文
共 50 条
  • [1] Compressed time delay neural network for small-footprint keyword spotting
    Sun, Ming
    Snyder, David
    Gao, Yixin
    Nagaraja, Varun
    Rodehorst, Mike
    Panchapagesan, Sankaran
    Strom, Nikko
    Matsoukas, Spyros
    Vitaladevuni, Shiv
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3607 - 3611
  • [2] Handwritten keyword spotting using deep neural networks and certainty prediction
    Daraee, Fatemeh
    Mozaffari, Saeed
    Razavi, Seyyed Mohammad
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
  • [3] SMALL-FOOTPRINT KEYWORD SPOTTING USING DEEP NEURAL NETWORKS
    Chen, Guoguo
    Parada, Carolina
    Heigold, Georg
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] An application of recurrent neural networks to discriminative keyword spotting
    Fernandez, Santiago
    Graves, Alex
    Schmidhuber, Juergen
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2007, PT 2, PROCEEDINGS, 2007, 4669 : 220 - +
  • [5] Deep Convolutional Spiking Neural Networks for Keyword Spotting
    Yilmaz, Emre
    Gevrek, Ozgur Bora
    Wu, Jibin
    Chen, Yuxiang
    Meng, Xuanbo
    Li, Haizhou
    [J]. INTERSPEECH 2020, 2020, : 2557 - 2561
  • [6] Small-footprint Spiking Neural Networks for Power-efficient Keyword Spotting
    Pedroni, Bruno U.
    Sheik, Sadique
    Mostafa, Hesham
    Paul, Somnath
    Augustine, Charles
    Cauwenberghs, Gert
    [J]. 2018 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS): ADVANCED SYSTEMS FOR ENHANCING HUMAN HEALTH, 2018, : 591 - 594
  • [7] Keyword Spotting using Dynamic Time Warping and Convolutional Recurrent Networks
    Albert, Erika-Timea
    Lemnaru, Camelia
    Dinsoreanu, Mihaela
    Potolea, Rodica
    [J]. 2019 IEEE 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2019), 2019, : 53 - 60
  • [8] Efficient Keyword Spotting through Hardware-Aware Conditional Execution of Deep Neural Networks
    Giraldo, J. S. P.
    O'Connor, Chris
    Verhelst, Marian
    [J]. 2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [9] Convolutional Neural Networks for Small-footprint Keyword Spotting
    Sainath, Tara N.
    Parada, Carolina
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1478 - 1482
  • [10] Combining Neural Networks to Improve Performance of Handwritten Keyword Spotting
    Frinken, Volkmar
    Fischer, Andreas
    Bunke, Horst
    [J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2010, 5997 : 215 - 224