Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices

被引:25
|
作者
Leem, Seong-Gyun [1 ]
Yoo, In-Chul [1 ]
Yook, Dongsuk [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, Artificial Intelligence Lab, Seoul 02841, South Korea
基金
新加坡国家研究基金会;
关键词
Deep neural network; keyword spotting; multitask learning;
D O I
10.1109/TCE.2019.2899067
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech-based interfaces are convenient and intuitive, and therefore, strongly preferred by Internet of Things (IoT) devices for human-computer interaction. Pre-defined keywords are typically used as a trigger to notify devices for inputting the subsequent voice commands. Keyword spotting techniques used as voice trigger mechanisms, typically model the target keyword via triphone models and non-keywords through single-state filler models. Recently, deep neural networks (DNNs) have shown better performance compared to hidden Markov models with Gaussian mixture models, in various tasks including speech recognition. However, conventional DNN-based keyword spotting methods cannot change the target keywords easily, which is an essential feature for speech-based IoT device interface. Additionally, the increase in computational requirements interferes with the use of complex filler models in DNN-based keyword spotting systems, which diminishes the accuracy of such systems. In this paper, we propose a novel DNN-based keyword spotting system that alters the keyword on the fly and utilizes triphone and monophone acoustic models in an effort to reduce computational complexity and increase generalization performance. The experimental results using the FFMTIMIT corpus show that the error rate of the proposed method was reduced by 36.6%.
引用
收藏
页码:188 / 194
页数:7
相关论文
共 50 条
  • [1] Neural Network Exploration for Keyword Spotting on Edge Devices
    Bushur, Jacob
    Chen, Chao
    [J]. FUTURE INTERNET, 2023, 15 (06)
  • [2] Keyword spotting based on recurrent neural network
    Zhou, JL
    Liu, J
    Song, YT
    Yu, TC
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 710 - 713
  • [3] Keyword spotting based on recurrent neural network
    Chinese Acad of Science, Beijing, China
    [J]. Int Conf Signal Process Proc, (710-713):
  • [4] Neural Network-based Vehicle Image Classification for IoT Devices
    Payvar, Saman
    Khan, Mir
    Stahl, Rafael
    Mueller-Gritschneder, Daniel
    Boutellier, Jani
    [J]. PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2019), 2019, : 148 - 153
  • [5] Keyword Spotting for Industrial Control using Deep Learning on Edge Devices
    Hoelzke, Fabian
    Ahmed, Hameem
    Golatowski, Frank
    Timmermann, Dirk
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2021, : 167 - 172
  • [6] Contextual Keyword Spotting in Lecture Video With Deep Convolutional Neural Network
    Andra, Muhammad Bagus
    Usagawa, Tsuyoshi
    [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 198 - 203
  • [7] Context Dependent Acoustic Keyword Spotting Using Deep Neural Network
    Wang, Guangsen
    Sim, Khe Chai
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [8] Neural Network-Based OFDM Receiver for Resource Constrained IoT Devices
    Soltani, Nasim
    Cheng, Hai
    Belgiovine, Mauro
    Li, Yanyu
    Li, Haoqing
    Azari, Bahar
    D'Oro, Salvatore
    Imbiriba, Tales
    Melodia, Tommaso
    Closas, Pau
    Wang, Yanzhi
    Erdogmus, Deniz
    Chowdhury, Kaushik
    [J]. IEEE Internet of Things Magazine, 2022, 5 (03): : 158 - 164
  • [9] HEKWS: Privacy-Preserving Convolutional Neural Network-based Keyword Spotting with a Ciphertext Packing Technique
    Elworth, Daniel L.
    Kim, Sunwoong
    [J]. 2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [10] DeepSL: Deep Neural Network-based Similarity Learning
    Tourad, Mohamedou Cheikh
    Abdelmounaim, Abdali
    Dhleima, Mohamed
    Telmoud, Cheikh Abdelkader Ahmed
    Lachgar, Mohamed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 1394 - 1401