MMSE-Optimal Spectral Amplitude Estimation Given the STFT-Phase

被引:47
|
作者
Gerkmann, Timo [1 ]
Krawczyk, Martin [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Speech Signal Proc Grp, D-26111 Oldenburg, Germany
关键词
Noise reduction; phase estimation; signal reconstruction; speech enhancement; SPEECH ENHANCEMENT; MAGNITUDE ESTIMATION;
D O I
10.1109/LSP.2012.2233470
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we derive a minimum mean squared error (MMSE) optimal estimator for clean speech spectral amplitudes, which we apply in single channel speech enhancement. As opposed to state-of-the-art estimators, the optimal estimator is derived for a given clean speech spectral phase. We show that the phase contains additional information that can be exploited to distinguish outliers in the noise from the target signal. With the proposed technique, incorporating the phase can potentially improve the PESQ-MOS by 0.5 in babble noise as compared to state-of-the-art amplitude estimators. In a blind setup we achieve a PESQ improvement of around 0.25 in voiced speech.
引用
收藏
页码:129 / 132
页数:4
相关论文
共 17 条
  • [1] β-order MMSE spectral amplitude estimation for speech enhancement
    You, CH
    Koh, SN
    Rahardja, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 475 - 486
  • [2] MMSE-optimal approximation of continuous-phase modulated signal as superposition of linearly modulated pulses
    Huang, XJ
    Li, YX
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2005, 53 (07) : 1166 - 1177
  • [3] MMSE-OPTIMAL ENHANCEMENT OF COMPLEX SPEECH COEFFICIENTS WITH UNCERTAIN PRIOR KNOWLEDGE OF THE CLEAN SPEECH PHASE
    Gerkmann, Timo
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] On MMSE-Based Estimation of Amplitude and Complex Speech Spectral Coefficients Under Phase-Uncertainty
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2251 - 2262
  • [5] Speech enhancement using MMSE estimation of amplitude and complex speech spectral coefficients under phase-uncertainty
    Kandagatla, Ravi Kumar
    Subbaiah, P. V.
    [J]. SPEECH COMMUNICATION, 2018, 96 : 10 - 27
  • [6] MMSE SPEECH SPECTRAL AMPLITUDE ESTIMATION ASSUMING NON-GAUSSIAN NOISE
    Fodor, Balazs
    Fingscheidt, Tim
    [J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2314 - 2318
  • [7] Speech enhancement based on β-order MMSE estimation of Short Time Spectral Amplitude and Laplacian speech modeling
    Abutalebi, Hamid Reza
    Rashidinejad, Mehdi
    [J]. SPEECH COMMUNICATION, 2015, 67 : 92 - 101
  • [8] On the probability of resolution for the Amplitude and Phase EStimation (APES) spectral estimator
    Richmond, CD
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1025 - 1028
  • [9] Optimal estimation of non-stationary phase and amplitude processes
    Andrieu, C
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 637 - 640
  • [10] On DCT-based MMSE estimation of short time spectral amplitude for single-channel speech enhancement
    Shi, Sisi
    Paliwal, Kuldip
    Busch, Andrew
    [J]. APPLIED ACOUSTICS, 2023, 202