Small-Footprint Keyword Spotting for Controlling Smart Home Appliances Using TCN and CRNN Models

被引:0
|
作者
Alapati, Hemalatha [1 ]
Paolini, Christopher [2 ]
Chinara, Suchismita [1 ]
Sarkar, Mahasweta [2 ]
机构
[1] Natl Inst Technol, Rourkela, India
[2] San Diego State Univ, San Diego, CA 92182 USA
关键词
Augmentation; Bandwidth; Convolutional Recurrent Neural Networks; Keyword Spotting; Smart Home Assistants; Temporal Convolutional Networks; Unknown Words; Voice-Operated Devices;
D O I
10.4018/IJITN.299365
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Smart homes feature automatic fire/smoke detection, voice-operated assets and appliances, etc. More often, smart home appliances like lights, fans, etc. can be controlled through voice commands. Voice-operated devices like Alexa, Siri, and Google Assistant are not new in the current age concerning voice command execution. However, working with these supports requires a global connection with the internet that costs time and bandwidth. Controlling home appliances needs concise commands involving keywords on/off. Further, to operate the home appliances, bandwidth consumption for internet is not a wise idea. Through this paper, models based on temporal convolutional networks (TCN) and convolutional recurrent neural networks (CRNN) have been studied for keyword spotting (KWS) by training models with keywords pronounced in different accents. The performance of these models is compared, and their ability to detect unknown words is studied. Finally, how these models are suitable for building smart home assistants to control home utilities with minimum bandwidth consumption is discussed.
引用
收藏
页数:12
相关论文
共 46 条
  • [1] STREAMING SMALL-FOOTPRINT KEYWORD SPOTTING USING SEQUENCE-TO-SEQUENCE MODELS
    He, Yanzhang
    Prabhavalkar, Rohit
    Rao, Kanishka
    Li, Wei
    Bakhtin, Anton
    McGraw, Ian
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 474 - 481
  • [2] SMALL-FOOTPRINT KEYWORD SPOTTING USING DEEP NEURAL NETWORKS
    Chen, Guoguo
    Parada, Carolina
    Heigold, Georg
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Convolutional Neural Networks for Small-footprint Keyword Spotting
    Sainath, Tara N.
    Parada, Carolina
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1478 - 1482
  • [4] EXPLORING REPRESENTATION LEARNING FOR SMALL-FOOTPRINT KEYWORD SPOTTING
    Cui, Fan
    Guo, Liyong
    Wang, Quandong
    Gao, Peng
    Wang, Yujun
    [J]. INTERSPEECH 2022, 2022, : 3258 - 3262
  • [5] Model compression applied to small-footprint keyword spotting
    Tucker, George
    Wu, Minhua
    Sun, Ming
    Panchapagesan, Sankaran
    Fu, Gengshen
    Vitaladevuni, Shiv
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1878 - 1882
  • [6] SMALL-FOOTPRINT KEYWORD SPOTTING WITH GRAPH CONVOLUTIONAL NETWORK
    Chen, Xi
    Yin, Shouyi
    Song, Dandan
    Ouyang, Peng
    Liu, Leibo
    Wei, Shaojun
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 539 - 546
  • [7] DEEP RESIDUAL LEARNING FOR SMALL-FOOTPRINT KEYWORD SPOTTING
    Tang, Raphael
    Lin, Jimmy
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5484 - 5488
  • [8] Region Proposal Network Based Small-Footprint Keyword Spotting
    Hou, Jingyong
    Shi, Yangyang
    Ostendorf, Mari
    Hwang, Mei-Yuh
    Xie, Lei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (10) : 1471 - 1475
  • [9] IMPROVING RNN TRANSDUCER MODELING FOR SMALL-FOOTPRINT KEYWORD SPOTTING
    Tian, Yao
    Yao, Haitao
    Cai, Meng
    Liu, Yaming
    Ma, Zejun
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5624 - 5628
  • [10] Deep Template Matching for Small-footprint and Configurable Keyword Spotting
    Zhang, Peng
    Zhang, Xueliang
    [J]. INTERSPEECH 2020, 2020, : 2572 - 2576