Constrained Ratio Mask for Speech Enhancement Using DNN

被引:4
|
作者
Yu, Hongjiang [1 ]
Zhu, Wei-Ping [1 ]
Yang, Yuhong [2 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
来源
基金
加拿大自然科学与工程研究理事会;
关键词
speech enhancement; constrained ratio mask; deep neural network; NOISE; SEPARATION; ALGORITHM;
D O I
10.21437/Interspeech.2020-1920
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speech enhancement has found many applications concerning robust speech processing. A masking based algorithm, as an important method of speech enhancement, aims to retain the speech dominant components and suppress the noise dominant parts of the noisy speech. In this paper, we derive a new type of mask: constrained ratio mask (CRM), which can better control the trade-off between speech distortion and residual noise in the enhanced speech. A deep neural network (DNN) is then employed for CRM estimation in noisy conditions. The estimated CRM is finally applied to the noisy speech for denoising. Experimental results show that the enhanced speech from the new masking scheme yields an improved speech quality over three existing masks under various noisy conditions.
引用
收藏
页码:2427 / 2431
页数:5
相关论文
共 50 条
  • [1] Ideal ratio mask estimation using supervised DNN approach for target speech signal enhancement
    Selvaraj, Poovarasan
    Chandra, E.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (03) : 1869 - 1883
  • [2] DNN Classification Model-based Speech Enhancement Using Mask Selection Technique
    Lee, Bong-Ki
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 436 - 440
  • [3] Towards More Efficient DNN-Based Speech Enhancement Using Quantized Correlation Mask
    Abdullah, Salinna
    Zamani, Majid
    Demosthenous, Andreas
    IEEE ACCESS, 2021, 9 : 24350 - 24362
  • [4] Speech Enhancement Based on a Joint Two-Stage CRN plus DNN-DEC Model and a New Constrained Phase-Sensitive Magnitude Ratio Mask
    Pashaian, Matin
    Seyedin, Sanaz
    IEEE ACCESS, 2024, 12 : 98567 - 98583
  • [5] DNN-Based Feature Enhancement Using DOA-Constrained ICA for Robust Speech Recognition
    Lee, Ho-Yong
    Cho, Ji-Won
    Kim, Minook
    Park, Hyung-Min
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (08) : 1091 - 1095
  • [6] DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
    Furnon, Nicolas
    Serizel, Romain
    Illina, Irina
    Essid, Slim
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4672 - 4676
  • [7] Improvement of Mask-Based Speech Source Separation Using DNN
    Zhan, Ge
    Huang, Zhaoqiong
    Ying, Dongwen
    Pan, Jielin
    Yan, Yonghong
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [8] Experimental Study on Speech Enhancement using DNN with perceptual Weighting
    Shi, Wenhua
    Zhang, Xiongwei
    Zou, Xia
    Sun, Meng
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING (ICCIP 2018), 2018, : 309 - 312
  • [9] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
    Huang, Qizheng
    Bao, Changchun
    Wang, Xianyun
    Xiang, Yang
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
  • [10] Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement
    Cui, Zihao
    Bao, Changchun
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 618 - 622