Phase Estimation in Single-Channel Speech Enhancement: Limits-Potential

被引:42
|
作者
Mowlaee, Pejman [1 ]
Kulmer, Josef [1 ]
机构
[1] Graz Univ Technol, Dept Elect Engn, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
基金
奥地利科学基金会;
关键词
Perceived quality; phase estimation; signal reconstruction; speech enhancement; speech intelligibility; COEFFICIENTS;
D O I
10.1109/TASLP.2015.2430820
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present an overview on the previous and recent methods proposed to estimate a clean spectral phase from a noisy observation in the context of single-channel speech enhancement. The importance of phase estimation in speech enhancement is inspired by the recent reports on its usefulness in finding a phase-sensitive amplitude estimation. We present a comparative study of the recent phase estimation methods and elaborate their limits. We propose a new phase enhancement method relying on phase decomposition and time-frequency smoothing filters. We demonstrate that the proposed time-frequency phase smoothing method successfully reduces the variance of the noisy phase at harmonics. Our results on different speech and noise databases and different signal-to-noise ratios show that in contrast to the existing benchmark methods only the proposed method balances a tradeoff between a joint improvement in perceived quality of 0.2 in PESQ score and speech intelligibility of 2% by phase-only enhancement.
引用
收藏
页码:1283 / 1294
页数:12
相关论文
共 50 条
  • [21] FPGA Implementation of a Phase-Aware Single-Channel Speech Enhancement System
    Samui, Suman
    Sahu, Pragya
    Chakrabarti, Indrajit
    Ghosh, Soumya K.
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (11) : 4688 - 4715
  • [22] FPGA Implementation of a Phase-Aware Single-Channel Speech Enhancement System
    Suman Samui
    Pragya Sahu
    Indrajit Chakrabarti
    Soumya K. Ghosh
    [J]. Circuits, Systems, and Signal Processing, 2017, 36 : 4688 - 4715
  • [23] An evaluation of the perceptual quality of phase-aware single-channel speech enhancement
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (04): : EL364 - EL369
  • [24] Single-channel speech enhancement using inter-component phase relations
    Barysenka, Siarhei Y.
    Vorobiov, Vasili, I
    Mowlaee, Pejman
    [J]. SPEECH COMMUNICATION, 2018, 99 : 144 - 160
  • [25] Single-channel speech enhancement by subspace affinity minimization
    Tran, Dung N.
    Koishida, Kazuhito
    [J]. INTERSPEECH 2020, 2020, : 2447 - 2451
  • [26] Single-Channel Speech Enhancement Based on Psychoacoustic Masking
    Zhou, Tingting
    Zeng, Yumin
    Wang, Rongrong
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 272 - 284
  • [27] Single-channel speech enhancement using colored spectrograms
    Gul, Sania
    Khan, Muhammad Salman
    Fazeel, Muhammad
    [J]. COMPUTER SPEECH AND LANGUAGE, 2024, 86
  • [28] CompNet: Complementary network for single-channel speech enhancement
    Fan, Cunhang
    Zhang, Hongmei
    Li, Andong
    Xiang, Wang
    Zheng, Chengshi
    Lv, Zhao
    Wu, Xiaopei
    [J]. NEURAL NETWORKS, 2023, 168 : 508 - 517
  • [29] Comparative Studies of Single-Channel Speech Enhancement Techniques
    Kumar, Bittu
    Kumar, Neeraj
    Kumar, Manoj
    Prasad, S. V. S.
    Varma, Ashwini Kumar
    Ravi, Banoth
    [J]. IETE JOURNAL OF RESEARCH, 2024, 70 (06) : 5704 - 5720
  • [30] Single-Channel Speech Enhancement Using Double Spectrum
    Blass, Martin
    Mowlaee, Pejman
    Kleijn, W. Bastiaan
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1740 - 1744