DYNAMIC SPARSITY NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION

被引:20
|
作者
Wu, Zhaofeng [1 ,2 ]
Zhao, Ding [2 ]
Liang, Qiao [2 ]
Yu, Jiahui [2 ]
Gulati, Anmol [2 ]
Pang, Ruoming [2 ]
机构
[1] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
[2] Google, Mountain View, CA 94043 USA
关键词
ASR; Model Pruning; Dynamic Sparse Models;
D O I
10.1109/ICASSP39728.2021.9414505
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In automatic speech recognition (ASR), model pruning is a widely adopted technique that reduces model size and latency to deploy neural network models on edge devices with resource constraints. However, multiple models with different sparsity levels usually need to be separately trained and deployed to heterogeneous target hardware with different resource specifications and for applications that have various latency requirements. In this paper, we present Dynamic Sparsity Neural Networks (DSNN) that, once trained, can instantly switch to any predefined sparsity configuration at run-time. We demonstrate the effectiveness and flexibility of DSNN using experiments on internal production datasets with Google Voice Search data, and show that the performance of a DSNN model is on par with that of individually trained single sparsity networks. Our trained DSNN model, therefore, can greatly ease the training process and simplify deployment in diverse scenarios with resource constraints.
引用
收藏
页码:6014 / 6018
页数:5
相关论文
共 50 条
  • [1] DYNAMIC NEURAL NETWORKS FOR SPEECH RECOGNITION
    OLAFSSON, S
    [J]. BT TECHNOLOGY JOURNAL, 1992, 10 (03): : 48 - 58
  • [2] Automatic Speech Recognition Based on Neural Networks
    Schlueter, Ralf
    Doetsch, Patrick
    Golik, Pavel
    Kitza, Markus
    Menne, Tobias
    Irie, Kazuki
    Tueske, Zoltan
    Zeyer, Albert
    [J]. SPEECH AND COMPUTER, 2016, 9811 : 3 - 17
  • [3] Dynamic Bayesian networks for automatic speech recognition
    Deviren, M
    [J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 981 - 981
  • [4] Automatic Speech Recognition with Deep Neural Networks for Impaired Speech
    Espana-Bonet, Cristina
    Fonollosa, Jose A. R.
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 97 - 107
  • [5] A comprehensive survey on automatic speech recognition using neural networks
    Amandeep Singh Dhanjal
    Williamjeet Singh
    [J]. Multimedia Tools and Applications, 2024, 83 : 23367 - 23412
  • [6] A comprehensive survey on automatic speech recognition using neural networks
    Dhanjal, Amandeep Singh
    Singh, Williamjeet
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23367 - 23412
  • [7] Automatic Recognition of Kazakh Speech Using Deep Neural Networks
    Mamyrbayev, Orken
    Turdalyuly, Mussa
    Mekebayev, Nurbapa
    Alimhan, Keylan
    Kydyrbekova, Aizat
    Turdalykyzy, Tolganay
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT II, 2019, 11432 : 465 - 474
  • [8] Fast speaker adaptation of artificial neural networks for automatic speech recognition
    Dupont, S
    Cheboub, L
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1795 - 1798
  • [9] Speech recognition using dynamic programming of Bayesian neural networks
    Huang, CC
    Wang, JF
    Wu, CH
    Lee, JY
    [J]. CENTRAL AUDITORY PROCESSING AND NEURAL MODELING, 1998, : 71 - 76
  • [10] Automatic Naturalness Recognition from Acted Speech Using Neural Networks
    Atmaja, Bagus Tris
    Sasou, Akira
    Akagi, Masato
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 731 - 736