Constrained Ratio Mask for Speech Enhancement Using DNN

被引：4

作者：

Yu, Hongjiang ^{[1
]}

Zhu, Wei-Ping ^{[1
]}

Yang, Yuhong ^{[2
]}

机构：

[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada

[2] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China

来源：

INTERSPEECH 2020 | 2020年

基金：

加拿大自然科学与工程研究理事会;

关键词：

speech enhancement; constrained ratio mask; deep neural network; NOISE; SEPARATION; ALGORITHM;

D O I：

10.21437/Interspeech.2020-1920

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

Speech enhancement has found many applications concerning robust speech processing. A masking based algorithm, as an important method of speech enhancement, aims to retain the speech dominant components and suppress the noise dominant parts of the noisy speech. In this paper, we derive a new type of mask: constrained ratio mask (CRM), which can better control the trade-off between speech distortion and residual noise in the enhanced speech. A deep neural network (DNN) is then employed for CRM estimation in noisy conditions. The estimated CRM is finally applied to the noisy speech for denoising. Experimental results show that the enhanced speech from the new masking scheme yields an improved speech quality over three existing masks under various noisy conditions.

引用

页码：2427 / 2431

页数：5

共 50 条

[1] Ideal ratio mask estimation using supervised DNN approach for target speech signal enhancement
Selvaraj, Poovarasan
Chandra, E.
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (03) : 1869 - 1883
[2] DNN Classification Model-based Speech Enhancement Using Mask Selection Technique
Lee, Bong-Ki
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 436 - 440
[3] Towards More Efficient DNN-Based Speech Enhancement Using Quantized Correlation Mask
Abdullah, Salinna
Zamani, Majid
Demosthenous, Andreas
IEEE ACCESS, 2021, 9 : 24350 - 24362
[4] Speech Enhancement Based on a Joint Two-Stage CRN plus DNN-DEC Model and a New Constrained Phase-Sensitive Magnitude Ratio Mask
Pashaian, Matin
Seyedin, Sanaz
IEEE ACCESS, 2024, 12 : 98567 - 98583
[5] DNN-Based Feature Enhancement Using DOA-Constrained ICA for Robust Speech Recognition
Lee, Ho-Yong
Cho, Ji-Won
Kim, Minook
Park, Hyung-Min
IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (08) : 1091 - 1095
[6] DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
Furnon, Nicolas
Serizel, Romain
Illina, Irina
Essid, Slim
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4672 - 4676
[7] Improvement of Mask-Based Speech Source Separation Using DNN
Zhan, Ge
Huang, Zhaoqiong
Ying, Dongwen
Pan, Jielin
Yan, Yonghong
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[8] Experimental Study on Speech Enhancement using DNN with perceptual Weighting
Shi, Wenhua
Zhang, Xiongwei
Zou, Xia
Sun, Meng
PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING (ICCIP 2018), 2018, : 309 - 312
[9] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
Huang, Qizheng
Bao, Changchun
Wang, Xianyun
Xiang, Yang
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
[10] Power Exponent Based Weighting Criterion for DNN-Based Mask Approximation in Speech Enhancement
Cui, Zihao
Bao, Changchun
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 618 - 622

← 1 2 3 4 5 →