DEMUCS-Mobile : On-device lightweight speech enhancement

被引:1
|
作者
Lee, Lukas [1 ]
Ji, Youna [1 ]
Lee, Minjae [1 ]
Choi, Min-Seok [1 ]
机构
[1] Naver Coporat, Gyeoggi, South Korea
来源
关键词
speech enhancement; on-device; mobile; channel; pruning; model compression;
D O I
10.21437/Interspeech.2021-1025
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
As the importance of speech enhancement for real-world application increases, the compactness of the model is also becoming a crucial study. In this paper, we present compression techniques to reduce the model size and applied them to the state-of-the-art real-time speech enhancement system. We successfully reduce the model size by actively applying channel pruning while maintaining performance. In particular, we propose a method to prune more channels of convolutional neural networks (CNN) by utilizing gated linear unit (GLU) activation. In addition, lower-bit-quantization is applied to reduce model size, while minimizing performance degradation caused by quantization. We show the performance of our proposed model on a mobile device where computing resources are limited. In particular, it is implemented to enable streaming, and speech enhancement works in real-time.
引用
收藏
页码:2711 / 2715
页数:5
相关论文
共 50 条
  • [1] ON-DEVICE NEURAL SPEECH SYNTHESIS
    Achanta, Sivanand
    Antony, Albert
    Golipour, Ladan
    Li, Jiangchuan
    Raitio, Tuomo
    Rasipuram, Ramya
    Rossi, Francesco
    Shi, Jennifer
    Upadhyay, Jaimin
    Winarsky, David
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1155 - 1161
  • [2] Target Selection Strategies for Demucs-Based Speech Enhancement
    Rascon, Caleb
    Fuentes-Pineda, Gibran
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [3] Lightweight Approximation of Softmax Layer for On-Device Inference
    Vasyltsov, Ihor
    Chang, Wooseok
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND APPLIED COGNITIVE COMPUTING, 2021, : 561 - 570
  • [4] A Lightweight On-Device Detection Method for Android Malware
    Yuan, Wei
    Jiang, Yuan
    Li, Heng
    Cai, Minghui
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (09): : 5600 - 5611
  • [5] Garbage Modeling for On-device Speech Recognition
    Van Gysel, Christophe
    Velikovich, Leonid
    McGraw, Ian
    Beaufays, Francoise
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2127 - 2131
  • [6] PUNCTUATION PREDICTION FOR STREAMING ON-DEVICE SPEECH RECOGNITION
    Zhou, Zhikai
    Tan, Tian
    Qian, Yanmin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7277 - 7281
  • [7] ATTENTION BASED ON-DEVICE STREAMING SPEECH RECOGNITION WITH LARGE SPEECH CORPUS
    Kim, Kwangyoun
    Lee, Kyungmin
    Gowda, Dhananjaya
    Park, Junmo
    Kim, Sungsoo
    Jin, Sichen
    Lee, Young-Yoon
    Yeo, Jinsu
    Kim, Daehyun
    Jung, Seokyeong
    Lee, Jungin
    Han, Myoungji
    Kim, Chanwoo
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 956 - 963
  • [8] Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
    Li, Wei
    Qin, James
    Chiu, Chung-Cheng
    Pang, Ruoming
    He, Yanzhang
    INTERSPEECH 2020, 2020, : 2122 - 2126
  • [9] Fast and lightweight on-device TTS with Tacotron2 and LPCNet
    Popov, Vadim
    Kamenev, Stanislav
    Kudinov, Mikhail
    Repyevsky, Sergey
    Sadekova, Tasnima
    Bushaev, Vitalii
    Kryzhanovskiy, Vladimir
    Parkhomenko, Denis
    INTERSPEECH 2020, 2020, : 220 - 224
  • [10] Robust Continuous On-device Personalization for Automatic Speech Recognition
    Sim, Khe Chai
    Chandorkar, Angad
    Gao, Fan
    Chua, Mason
    Munkhdalai, Tsendsuren
    Beaufays, Francoise
    INTERSPEECH 2021, 2021, : 1284 - 1288