Efficient keyword spotting using time delay neural networks

被引:0
|
作者
Myer, Samuel [1 ]
Tomar, Vikrant Singh [1 ]
机构
[1] Fluent Ai Inc, Montreal, PQ, Canada
关键词
keyword spotting; wake word; time-delay neural network; transfer learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a novel method of live keyword spotting using a two-stage time delay neural network. The model is trained using transfer learning: initial training with phone targets from a large speech corpus is followed by training with keyword targets from a smaller data set. The accuracy of the system is evaluated on two separate tasks. The first is the freely available Google Speech Commands dataset. The second is an in-house task specifically developed for keyword spotting. The results show significant improvements in false accept and false reject rates in both clean and noisy environments when compared with previously known techniques. Furthermore, we investigate various techniques to reduce computation in terms of multiplications per second of audio. Compared to recently published work, the proposed system provides up to 89% savings on computational complexity.
引用
收藏
页码:1264 / 1268
页数:5
相关论文
共 50 条
  • [21] Neural Architecture Search For Keyword Spotting
    Mo, Tong
    Yu, Yakun
    Salameh, Mohammad
    Niu, Di
    Jui, Shangling
    [J]. INTERSPEECH 2020, 2020, : 1982 - 1986
  • [22] Keyword spotting in continuous speech using convolutional neural network
    Rostami, Amir Mohammad
    Karimi, Ali
    Akhaee, Mohammad Ali
    [J]. Speech Communication, 2022, 142 : 15 - 21
  • [23] Keyword spotting in continuous speech using convolutional neural network
    Rostami, Amir Mohammad
    Karimi, Ali
    Akhaee, Mohammad Ali
    [J]. SPEECH COMMUNICATION, 2022, 142 : 15 - 21
  • [24] Robust and efficient keyword spotting using a bidirectional attention LSTM
    Swain O.P.
    Hemanth H.
    Saran P.
    Kothandaraman M.
    Ravi L.
    Sailor H.
    Rajesh K.S.
    [J]. International Journal of Speech Technology, 2023, 26 (04) : 919 - 931
  • [25] Efficient Keyword Spotting System Using Deformable Convolutional Network
    Nguyen, Huu Binh
    Duong, Van Hai
    Tran Thi, Anh Xuan
    Nguyen, Quoc Cuong
    [J]. IETE JOURNAL OF RESEARCH, 2023, 69 (07) : 4196 - 4204
  • [26] Keyword Spotting with Convolutional Deep Belief Networks and Dynamic Time Warping
    Wicht, Baptiste
    Fischer, Andreas
    Hennebert, Jean
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 113 - 120
  • [27] Time-Delay-Neural-Network-Based Audio Feature Extractor for Ultra-Low Power Keyword Spotting
    Fuketa, Hiroshi
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (02) : 334 - 338
  • [28] Context Dependent Acoustic Keyword Spotting Using Deep Neural Network
    Wang, Guangsen
    Sim, Khe Chai
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [29] Keyword spotting based on recurrent neural network
    Zhou, JL
    Liu, J
    Song, YT
    Yu, TC
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 710 - 713
  • [30] Efficient training of Time Delay Neural Networks for sequential patterns
    Cancelliere, R
    Gemello, R
    [J]. NEUROCOMPUTING, 1996, 10 (01) : 33 - 42