AUTOKWS: KEYWORD SPOTTING WITH DIFFERENTIABLE ARCHITECTURE SEARCH

被引:9
|
作者
Zhang, Bo [1 ]
Li, Wenfeng [1 ]
Li, Qingyuan [1 ]
Zhuang, Weiji [1 ]
Chu, Xiangxiang [1 ]
Wang, Yujun [1 ]
机构
[1] Xiaomi AI Lab, Beijing, Peoples R China
关键词
Keyword spotting; neural architecture search;
D O I
10.1109/ICASSP39728.2021.9414848
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Smart audio devices are gated by an always-on lightweight keyword spotting program to reduce power consumption. It is however challenging to design models that have both high accuracy and low latency for accurate and fast responsiveness. Many efforts have been made to develop end-to-end neural networks, in which depthwise separable convolutions, temporal convolutions, and LSTMs are adopted as building units. Nonetheless, these networks designed with human expertise may not achieve an optimal trade-off in an expansive search space. In this paper, we propose to leverage recent advances in differentiable neural architecture search to discover more efficient networks. Our searched model attains 97.2% top-1 accuracy on Google Speech Command Dataset v1 with only nearly 100K parameters.
引用
收藏
页码:2830 / 2834
页数:5
相关论文
共 50 条
  • [31] Graph Differentiable Architecture Search with Structure Learning
    Qin, Yijian
    Wang, Xin
    Zhang, Zeyang
    Zhu, Wenwu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [32] Layered feature representation for differentiable architecture search
    Hao, Jie
    Zhu, William
    [J]. SOFT COMPUTING, 2022, 26 (10) : 4741 - 4753
  • [33] DyConvMixer: Dynamic Convolution Mixer Architecture for Open-Vocabulary Keyword Spotting
    Gharbieh, Waseem
    Huang, Jinmiao
    Wan, Qianhui
    Shim, Han Suk
    Lee, Chul
    [J]. INTERSPEECH 2022, 2022, : 5205 - 5209
  • [34] A two-step keyword spotting method using fuzzy search algorithm
    Zhang Pengyuan
    Shao Jian
    Yan Yonghong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2008, 17 (01) : 33 - 36
  • [35] Discriminative keyword spotting using triphones information and N-best search
    Tabibian, Shima
    Akbari, Ahmad
    Nasersharif, Babak
    [J]. INFORMATION SCIENCES, 2018, 423 : 157 - 171
  • [36] METRIC LEARNING FOR KEYWORD SPOTTING
    Huh, Jaesung
    Lee, Minjae
    Heo, Heesoo
    Mun, Seongkyu
    Chung, Joon Son
    [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 133 - 140
  • [37] FEDERATED LEARNING FOR KEYWORD SPOTTING
    Leroy, David
    Coucke, Alice
    Lavril, Thibaut
    Gisselbrecht, Thibault
    Dureau, Joseph
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6341 - 6345
  • [38] Analog LSTM for Keyword Spotting
    Odame, Kofi
    Nyamukuru, Maria
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 375 - 378
  • [39] Latency Control for Keyword Spotting
    Jose, Christin
    Wang, Joseph
    Strimel, Grant P.
    Khursheed, Mohammad Omar
    Mishchenko, Yuriy
    Kulis, Brian
    [J]. INTERSPEECH 2022, 2022, : 1891 - 1895
  • [40] A New Keyword Spotting Approach
    Bahi, Halima
    Benati, Nadia
    [J]. 2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 77 - +