DYNAMIC SPARSITY NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION

被引：20

作者：

Wu, Zhaofeng ^{[1
,2
]}

Zhao, Ding ^{[2
]}

Liang, Qiao ^{[2
]}

Yu, Jiahui ^{[2
]}

Gulati, Anmol ^{[2
]}

Pang, Ruoming ^{[2
]}

机构：

[1] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA

[2] Google, Mountain View, CA 94043 USA

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年

关键词：

ASR; Model Pruning; Dynamic Sparse Models;

D O I：

10.1109/ICASSP39728.2021.9414505

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In automatic speech recognition (ASR), model pruning is a widely adopted technique that reduces model size and latency to deploy neural network models on edge devices with resource constraints. However, multiple models with different sparsity levels usually need to be separately trained and deployed to heterogeneous target hardware with different resource specifications and for applications that have various latency requirements. In this paper, we present Dynamic Sparsity Neural Networks (DSNN) that, once trained, can instantly switch to any predefined sparsity configuration at run-time. We demonstrate the effectiveness and flexibility of DSNN using experiments on internal production datasets with Google Voice Search data, and show that the performance of a DSNN model is on par with that of individually trained single sparsity networks. Our trained DSNN model, therefore, can greatly ease the training process and simplify deployment in diverse scenarios with resource constraints.

引用

页码：6014 / 6018

页数：5

共 50 条

[1] DYNAMIC NEURAL NETWORKS FOR SPEECH RECOGNITION
OLAFSSON, S
[J]. BT TECHNOLOGY JOURNAL, 1992, 10 (03): : 48 - 58
[2] Automatic Speech Recognition Based on Neural Networks
Schlueter, Ralf
Doetsch, Patrick
Golik, Pavel
Kitza, Markus
Menne, Tobias
Irie, Kazuki
Tueske, Zoltan
Zeyer, Albert
[J]. SPEECH AND COMPUTER, 2016, 9811 : 3 - 17
[3] Dynamic Bayesian networks for automatic speech recognition
Deviren, M
[J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 981 - 981
[4] Automatic Speech Recognition with Deep Neural Networks for Impaired Speech
Espana-Bonet, Cristina
Fonollosa, Jose A. R.
[J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 97 - 107
[5] A comprehensive survey on automatic speech recognition using neural networks
Amandeep Singh Dhanjal
Williamjeet Singh
[J]. Multimedia Tools and Applications, 2024, 83 : 23367 - 23412
[6] A comprehensive survey on automatic speech recognition using neural networks
Dhanjal, Amandeep Singh
Singh, Williamjeet
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23367 - 23412
[7] Automatic Recognition of Kazakh Speech Using Deep Neural Networks
Mamyrbayev, Orken
Turdalyuly, Mussa
Mekebayev, Nurbapa
Alimhan, Keylan
Kydyrbekova, Aizat
Turdalykyzy, Tolganay
[J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT II, 2019, 11432 : 465 - 474
[8] Fast speaker adaptation of artificial neural networks for automatic speech recognition
Dupont, S
Cheboub, L
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1795 - 1798
[9] Speech recognition using dynamic programming of Bayesian neural networks
Huang, CC
Wang, JF
Wu, CH
Lee, JY
[J]. CENTRAL AUDITORY PROCESSING AND NEURAL MODELING, 1998, : 71 - 76
[10] Automatic Naturalness Recognition from Acted Speech Using Neural Networks
Atmaja, Bagus Tris
Sasou, Akira
Akagi, Masato
[J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 731 - 736

← 1 2 3 4 5 →