An empirical study of cross-lingual transfer learning techniques for small-footprint keyword spotting

被引:7
|
作者
Sun, Ming [1 ]
Schwarz, Andreas [1 ]
Wu, Minhua [1 ]
Strom, Nikko [1 ]
Matsoukas, Spyros [1 ]
Vitaladevuni, Shiv [1 ]
机构
[1] Amazon Com, Alexa Machine Learning, Seattle, WA USA
关键词
transfer learning; keyword spotting; cross lingual; small-footprint; CANONICAL CORRELATION-ANALYSIS; SPEECH;
D O I
10.1109/ICMLA.2017.0-150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents our work on building a small footprint keyword spotting system for a resource-limited language, which requires low CPU, memory and latency. Our keyword spotting system consists of deep neural network (DNN) and hidden Markov model (HMM), which is a hybrid DNN-HMM decoder. We investigate different transfer learning techniques to leverage knowledge and data from a resource-abundant source language to improve the keyword DNN training for a target language which has limited in-domain data. The approaches employed in this paper include training a DNN using source language data to initialize the target language DNN training, mixing data from source and target languages together in a multi-task DNN training setup, using logits computed from a DNN trained on the source language data to regularize the keyword DNN training in the target language, as well as combinations of these techniques. Given different amounts of target language training data, our experimental results show that these transfer learning techniques successfully improve keyword spotting performance for the target language, measured by the area under the curve (AUC) of DNN-HMM decoding detection error tradeoff (DET) curves using a large in-house far-field test set.
引用
收藏
页码:255 / 260
页数:6
相关论文
共 50 条
  • [21] Domain Aware Training for Far-field Small-footprint Keyword Spotting
    Wu, Haiwei
    Jia, Yan
    Nie, Yuanfei
    Li, Ming
    [J]. INTERSPEECH 2020, 2020, : 2562 - 2566
  • [22] Joint Framework of Curriculum Learning and Knowledge Distillation for Noise-Robust and Small-Footprint Keyword Spotting
    Lim, Jaebong
    Baek, Yunju
    [J]. IEEE ACCESS, 2023, 11 : 100540 - 100553
  • [23] SMALL-FOOTPRINT KEYWORD SPOTTING ON RAW AUDIO DATA WITH SINC-CONVOLUTIONS
    Mittermaier, Simon
    Kuerzinger, Ludwig
    Waschneck, Bernd
    Rigoll, Gerhard
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7454 - 7458
  • [24] Small-footprint Spiking Neural Networks for Power-efficient Keyword Spotting
    Pedroni, Bruno U.
    Sheik, Sadique
    Mostafa, Hesham
    Paul, Somnath
    Augustine, Charles
    Cauwenberghs, Gert
    [J]. 2018 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS): ADVANCED SYSTEMS FOR ENHANCING HUMAN HEALTH, 2018, : 591 - 594
  • [25] Error-Diffusion Based Speech Feature Quantization for Small-Footprint Keyword Spotting
    Luo, Mengjie
    Wang, Dingyi
    Wang, Xiaoqin
    Qiao, Shushan
    Zhou, Yumei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1357 - 1361
  • [26] A Configurable Accelerator for Keyword Spotting Based on Small-Footprint Temporal Efficient Neural Network
    He, Keyan
    Chen, Dihu
    Su, Tao
    [J]. ELECTRONICS, 2022, 11 (16)
  • [27] Attention-based End-to-End Models for Small-Footprint Keyword Spotting
    Shan, Changhao
    Zhang, Junbo
    Wang, Yujun
    Xie, Lei
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2037 - 2041
  • [28] Improved Small-Footprint ASR-Based Solution for Open Vocabulary Keyword Spotting
    Pudo, Mikolaj
    Wosik, Mateusz
    Janicki, Artur
    [J]. IEEE ACCESS, 2024, 12 : 91289 - 91299
  • [29] Reduced Model Size Deep Convolutional Neural Networks for Small-Footprint Keyword Spotting
    Tsai, Tsung Han
    Lin, Xin Hui
    [J]. 2021 28TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (IEEE ICECS 2021), 2021,
  • [30] VIRTUAL ADVERSARIAL TRAINING FOR DS-CNN BASED SMALL-FOOTPRINT KEYWORD SPOTTING
    Wang, Xiong
    Sun, Sining
    Xie, Lei
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 607 - 612