Constrained Ratio Mask for Speech Enhancement Using DNN

被引:4
|
作者
Yu, Hongjiang [1 ]
Zhu, Wei-Ping [1 ]
Yang, Yuhong [2 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
来源
基金
加拿大自然科学与工程研究理事会;
关键词
speech enhancement; constrained ratio mask; deep neural network; NOISE; SEPARATION; ALGORITHM;
D O I
10.21437/Interspeech.2020-1920
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Speech enhancement has found many applications concerning robust speech processing. A masking based algorithm, as an important method of speech enhancement, aims to retain the speech dominant components and suppress the noise dominant parts of the noisy speech. In this paper, we derive a new type of mask: constrained ratio mask (CRM), which can better control the trade-off between speech distortion and residual noise in the enhanced speech. A deep neural network (DNN) is then employed for CRM estimation in noisy conditions. The estimated CRM is finally applied to the noisy speech for denoising. Experimental results show that the enhanced speech from the new masking scheme yields an improved speech quality over three existing masks under various noisy conditions.
引用
收藏
页码:2427 / 2431
页数:5
相关论文
共 50 条
  • [31] Speech Enhancement Using RNN models and Ideal Exponential Mask
    Panda, Anirban
    PROCEEDINGS OF 2018 IEEE APPLIED SIGNAL PROCESSING CONFERENCE (ASPCON), 2018, : 178 - 182
  • [32] Harmonic Structure Mask for Speech Enhancement Using Sparsity Regularization
    Wang, Haonan
    Iwai, Kenta
    Nishiura, Takanobu
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 662 - 666
  • [33] DNN Based Speech Enhancement for Unseen Noises Using Monte Carlo Dropout
    Nazreen, P. M.
    Ramakrishnan, A. G.
    2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [34] An Integrated CNN-GRU Framework for Complex Ratio Mask Estimation in Speech Enhancement
    Hasannezhad, Mojtaba
    Ouyang, Zhiheng
    Zhu, Wei-Ping
    Champagne, Benoit
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 764 - 768
  • [35] Ideal neighbourhood mask for speech enhancement
    Arcos, C. D.
    Vellasco, M.
    Alcaim, A.
    ELECTRONICS LETTERS, 2018, 54 (05) : 317 - 318
  • [36] DNN-BASED ENHANCEMENT OF NOISY AND REVERBERANT SPEECH
    Zhao, Yan
    Wang, DeLiang
    Merks, Ivo
    Zhang, Tao
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6525 - 6529
  • [37] Significance of Phase in DNN based speech enhancement algorithms
    Rani, P. Swetha
    Andhavarapu, Sivaganesh
    Kodukula, Sri Rama Murry
    2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,
  • [38] ILMSAF based speech enhancement with DNN and noise classification
    Li, Ruwei
    Liu, Yanan
    Shi, Yongqiang
    Dong, Liang
    Cui, Weili
    SPEECH COMMUNICATION, 2016, 85 : 53 - 70
  • [39] Using Optimal Ratio Mask as Training Target for Supervised Speech Separation
    Xia, Shasha
    Li, Hao
    Zhang, Xueliang
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 163 - 166
  • [40] Ideal neighbourhood mask for speech enhancement using deep neural networks
    Arcos, Christian
    Vellasco, Marley
    Alcaim, Abraham
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,