Constrained Ratio Mask for Speech Enhancement Using DNN

被引:4
|
作者
Yu, Hongjiang [1 ]
Zhu, Wei-Ping [1 ]
Yang, Yuhong [2 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
来源
基金
加拿大自然科学与工程研究理事会;
关键词
speech enhancement; constrained ratio mask; deep neural network; NOISE; SEPARATION; ALGORITHM;
D O I
10.21437/Interspeech.2020-1920
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speech enhancement has found many applications concerning robust speech processing. A masking based algorithm, as an important method of speech enhancement, aims to retain the speech dominant components and suppress the noise dominant parts of the noisy speech. In this paper, we derive a new type of mask: constrained ratio mask (CRM), which can better control the trade-off between speech distortion and residual noise in the enhanced speech. A deep neural network (DNN) is then employed for CRM estimation in noisy conditions. The estimated CRM is finally applied to the noisy speech for denoising. Experimental results show that the enhanced speech from the new masking scheme yields an improved speech quality over three existing masks under various noisy conditions.
引用
收藏
页码:2427 / 2431
页数:5
相关论文
共 50 条
  • [21] Codebook-driven speech enhancement using DNN and harmonic emphasis
    Yang, Yan
    Bao, Changchun
    Wang, Xianyun
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 149 - 154
  • [22] Monaural speech enhancement combining accurate ratio mask and deep neural network
    BAI Haojun
    ZHANG Tianqi
    LIU Jianxing
    YE Shaopeng
    Chinese Journal of Acoustics, 2022, 41 (04) : 373 - 389
  • [23] Joint Ideal Ratio Mask and Generative Adversarial Networks for Monaural Speech Enhancement
    Yuan, Jing
    Bao, Changchun
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 276 - 280
  • [24] Integration of DNN based Speech Enhancement and ASR
    Astudillo, Ramon F.
    Correia, Joana
    Trancoso, Isabel
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3576 - 3580
  • [25] USING SEPARATE LOSSES FOR SPEECH AND NOISE IN MASK-BASED SPEECH ENHANCEMENT
    Xu, Ziyi
    Elshamy, Samy
    Fingscheidt, Tim
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7519 - 7523
  • [26] Speech enhancement using a constrained iterative sinusoidal model
    Jensen, J
    Hansen, JHL
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (07): : 731 - 740
  • [27] Constrained Iterative Speech Enhancement Using Phonetic Classes
    Das, Amit
    Hansen, John H. L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1869 - 1883
  • [28] Speech enhancement using the constrained-optimization technique
    Li, W
    Siu, WC
    IEEE SIGNAL PROCESSING LETTERS, 2000, 7 (02) : 28 - 30
  • [29] JOINT NOISE AND MASK AWARE TRAINING FOR DNN-BASED SPEECH ENHANCEMENT WITH SUB-BAND FEATURES
    Wang, Qing
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 101 - 105
  • [30] DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING
    Pfeifenberger, Lukas
    Zoehrer, Matthias
    Pernkopf, Franz
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 66 - 70