Investigation on the Band Importance of Phase-aware Speech Enhancement

被引:1
|
作者
Zhang, Zhuohuang [1 ,2 ]
Williamson, Donald S. [1 ]
Shen, Yi [3 ]
机构
[1] Indiana Univ, Dept Comp Sci, Bloomington, IN 47405 USA
[2] Indiana Univ, Dept Speech Language & Hearing Sci, Bloomington, IN 47405 USA
[3] Univ Washington, Dept Speech & Hearing Sci, Seattle, WA 98195 USA
来源
关键词
speech enhancement; phase; speech perception; INTELLIGIBILITY; SENTENCES; PITCH; NETWORK; WORDS; NOISE;
D O I
10.21437/Interspeech.2022-284
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many existing phase-aware speech enhancement algorithms consider the phase at all spectral frequencies to be equally important to perceptual quality and intelligibility. Although improvements are observed according to both objective and subjective measures, as compared to phase-insensitive approaches, it is not clear whether phase information is equally important across the frequency spectrum. In this paper, we investigate the importance of estimating phase across spectral regions, by conducting a pairwise listening study to determine if phase enhancement can be limited to certain frequency bands. Our experimental results suggest that estimating phase at lower-frequency bands is mostly important for speech quality in normal-hearing (NH) listeners. We further propose a hybrid deep-learning framework that adopts two sub-networks for handling phase differently across the spectrum. The proposed hybrid-net significantly improves the model compatibility with low-resource platforms while achieving superior performance to the original phase-aware speech enhancement approaches.
引用
收藏
页码:4651 / 4655
页数:5
相关论文
共 50 条
  • [1] Phase-Aware Single-channel Speech Enhancement
    Mowlaee, Pejman
    Watanabe, Mario Kaoru
    Saeidi, Rahim
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1871 - 1873
  • [2] Phase-Aware Speech Enhancement With Complex Wiener Filter
    Nguyen, Huy
    Ho, Tuan Vu
    Akagi, Masato
    Unoki, Masashi
    [J]. IEEE ACCESS, 2023, 11 : 141573 - 141584
  • [3] Phase-Aware Speech Enhancement Based on Deep Neural Networks
    Zheng, Naijun
    Zhang, Xiao-Lei
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 63 - 76
  • [4] On Speech Intelligibility Estimation of Phase-Aware Single-Channel Speech Enhancement
    Gaich, Andreas
    Mowlaee, Pejman
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2553 - 2557
  • [5] PACDNN: A phase-aware composite deep neural network for speech enhancement
    Hasannezhad, Mojtaba
    Yu, Hongjiang
    Zhu, Wei-Ping
    Champagne, Benoit
    [J]. SPEECH COMMUNICATION, 2022, 136 : 1 - 13
  • [6] Vector-quantized Variational Autoencoder for Phase-aware Speech Enhancement
    Tuan Vu Ho
    Quoc Huy Nguyen
    Akagi, Masato
    Unoki, Masashi
    [J]. INTERSPEECH 2022, 2022, : 176 - 180
  • [7] A Study on the Benefits of Phase-Aware Speech Enhancement in Challenging Noise Scenarios
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 407 - 416
  • [8] ON SPEECH QUALITY ES TIMATION OF PHASE-AWARE SINGLE-CHANNEL SPEECH ENHANCEMENT
    Gaich, Andreas
    Mowlaee, Pejman
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 216 - 220
  • [9] Funnel Deep Complex U-net for Phase-Aware Speech Enhancement
    Sun, Yuhang
    Yang, Linju
    Zhu, Huifeng
    Hao, Jie
    [J]. INTERSPEECH 2021, 2021, : 161 - 165
  • [10] Speech Enhancement Based on Fusion of Both Magnitude/Phase-Aware Features and Targets
    Lang, Haitao
    Yang, Jie
    [J]. ELECTRONICS, 2020, 9 (07): : 1 - 19