Speech Enhancement Based on Fusion of Both Magnitude/Phase-Aware Features and Targets

被引:3
|
作者
Lang, Haitao [1 ]
Yang, Jie [1 ]
机构
[1] Beijing Univ Chem Technol, Sch Math & Phys, Beijing 100026, Peoples R China
来源
ELECTRONICS | 2020年 / 9卷 / 07期
关键词
speech enhancement; acoustic feature; phase estimation; deep neural networks (DNNs); ACOUSTIC NOISE; ALGORITHM; SEPARATION; RATIO; INTELLIGIBILITY; MASK;
D O I
10.3390/electronics9071125
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, supervised learning methods have shown promising performance, especially deep neural network-based (DNN) methods, in the application of single-channel speech enhancement. Generally, those approaches extract the acoustic features directly from the noisy speech to train a magnitude-aware target. In this paper, we propose to extract the acoustic features not only from the noisy speech but also from the pre-estimated speech, noise and phase separately, then fuse them into a new complementary feature for the purpose of obtaining more discriminative acoustic representation. In addition, on the basis of learning a magnitude-aware target, we also utilize the fusion feature to learn a phase-aware target, thereby further improving the accuracy of the recovered speech. We conduct extensive experiments, including performance comparison with some typical existing methods, generalization ability evaluation on unseen noise, ablation study, and subjective test by human listener, to demonstrate the feasibility and effectiveness of the proposed method. Experimental results prove that the proposed method has the ability to improve the quality and intelligibility of the reconstructed speech.
引用
收藏
页码:1 / 19
页数:18
相关论文
共 50 条
  • [1] Phase-Aware Speech Enhancement Based on Deep Neural Networks
    Zheng, Naijun
    Zhang, Xiao-Lei
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 63 - 76
  • [2] Investigation on the Band Importance of Phase-aware Speech Enhancement
    Zhang, Zhuohuang
    Williamson, Donald S.
    Shen, Yi
    [J]. INTERSPEECH 2022, 2022, : 4651 - 4655
  • [3] Phase-Aware Single-channel Speech Enhancement
    Mowlaee, Pejman
    Watanabe, Mario Kaoru
    Saeidi, Rahim
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1871 - 1873
  • [4] Phase-Aware Speech Enhancement With Complex Wiener Filter
    Nguyen, Huy
    Ho, Tuan Vu
    Akagi, Masato
    Unoki, Masashi
    [J]. IEEE ACCESS, 2023, 11 : 141573 - 141584
  • [5] On Speech Intelligibility Estimation of Phase-Aware Single-Channel Speech Enhancement
    Gaich, Andreas
    Mowlaee, Pejman
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2553 - 2557
  • [6] PACDNN: A phase-aware composite deep neural network for speech enhancement
    Hasannezhad, Mojtaba
    Yu, Hongjiang
    Zhu, Wei-Ping
    Champagne, Benoit
    [J]. SPEECH COMMUNICATION, 2022, 136 : 1 - 13
  • [7] Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement
    Tuan Vu Ho
    Quoc Huy Nguyen
    Akagi, Masato
    Unoki, Masashi
    [J]. INTERSPEECH 2022, 2022, : 176 - 180
  • [8] A Study on the Benefits of Phase-Aware Speech Enhancement in Challenging Noise Scenarios
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 407 - 416
  • [9] ON SPEECH QUALITY ES TIMATION OF PHASE-AWARE SINGLE-CHANNEL SPEECH ENHANCEMENT
    Gaich, Andreas
    Mowlaee, Pejman
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 216 - 220
  • [10] Funnel Deep Complex U-net for Phase-Aware Speech Enhancement
    Sun, Yuhang
    Yang, Linju
    Zhu, Huifeng
    Hao, Jie
    [J]. INTERSPEECH 2021, 2021, : 161 - 165