Increasing Compactness of Deep Learning Based Speech Enhancement Models With Parameter Pruning and Quantization Techniques

被引:16
|
作者
Wu, Jyun-Yi [1 ]
Yu, Cheng [1 ]
Fu, Szu-Wei [2 ]
Liu, Chih-Ting [1 ]
Chien, Shao-Yi [1 ]
Tsao, Yu [2 ]
机构
[1] Natl Taiwan Univ, Grad Inst Environm Engn, Taipei 10673, Taiwan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan
关键词
Compactness; Parameter Pruning; Parameter Quantization; Low Computational Cost; SUPPRESSION; NOISE;
D O I
10.1109/LSP.2019.2951950
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The most recent studies on deep learning based speech enhancement (SE) are focused on improving denoising performance. However, successful SE applications require striking a desirable balance between the denoising performance and computational cost in real scenarios. In this study, we propose a novel parameter pruning (PP) technique, which removes redundant channels in a neural network. In addition, parameter quantization (PQ) and feature-map quantization (FQ) techniques were also integrated to generate even more compact SE models. The experimental results show that the integration of PP, PQ, and FQ can produce a compacted SE model with a size of only 9.76 % compared to that of the original model, resulting in minor performance losses of 0.01 (from 0.85 to 0.84) and 0.03 (from 2.55 to 2.52) for STOI and PESQ scores, respectively. These promising results confirm that the PP, PQ, and FQ techniques can be used to effectively reduce the storage of an SE system on edge devices.
引用
收藏
页码:1887 / 1891
页数:5
相关论文
共 50 条
  • [1] Optimizing energy consumption in deep learning models using pruning and quantization techniques
    Al-Alshaikh, Halah A.
    [J]. JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2024, 45 (05): : 1453 - 1463
  • [2] An Overview of Speech Enhancement Based on Deep Learning Techniques
    Jannu, Chaitanya
    Vanambathina, Sunny Dayal
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2023,
  • [3] Speech Enhancement: Traditional and Deep Learning Techniques
    Gaddamedi, Satya Prasad
    Patel, Anuj
    Chandra, Sabyasachi
    Bharati, Puja
    Ghosh, Nirmalya
    Das Mandal, Shyamal Kumar
    [J]. PROCEEDINGS OF 27TH INTERNATIONAL SYMPOSIUM ON FRONTIERS OF RESEARCH IN SPEECH AND MUSIC, FRSM 2023, 2024, 1455 : 75 - 86
  • [4] Explaining deep learning models for speech enhancement
    Sivasankaran, Sunit
    Vincent, Emmanuel
    Fohr, Dominique
    [J]. INTERSPEECH 2021, 2021, : 696 - 700
  • [5] On the Robustness of Deep Learning-Based Speech Enhancement
    Chhetri, Amit S.
    Hilmes, Philip
    Athi, Mrudula
    Shankar, Nikhil
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1587 - 1594
  • [6] Voice disorder classification using speech enhancement and deep learning models
    Chaiani, Mounira
    Selouani, Sid Ahmed
    Boudraa, Malika
    Yakoub, Mohammed Sidi
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2022, 42 (02) : 463 - 480
  • [7] Speech enhancement with noise estimation and filtration using deep learning models
    Kantamaneni, Sravanthi
    Charles, A.
    Babu, T. Ranga
    [J]. THEORETICAL COMPUTER SCIENCE, 2023, 941 : 14 - 28
  • [8] Deep Learning Models for Single-Channel Speech Enhancement on Drones
    Mukhutdinov, Dmitrii
    Alex, Ashish
    Cavallaro, Andrea
    Wang, Lin
    [J]. IEEE ACCESS, 2023, 11 : 22993 - 23007
  • [9] Speech enhancement with noise estimation and filtration using deep learning models
    Kantamaneni, Sravanthi
    Charles, A.
    Babu, T. Ranga
    [J]. THEORETICAL COMPUTER SCIENCE, 2023, 941 : 14 - 28
  • [10] Hardware-oriented pruning and quantization of Deep Learning models to detect life-threatening arrhythmias
    Gonzalez-Carabarin, Lizeth
    Schmid, Alexandre
    Van Sloun, Ruud J. G.
    [J]. 2021 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (IEEE BIOCAS 2021), 2021,