On MMSE-Based Estimation of Amplitude and Complex Speech Spectral Coefficients Under Phase-Uncertainty

被引:26
|
作者
Krawczyk-Becker, Martin [1 ]
Gerkmann, Timo [1 ]
机构
[1] Univ Hamburg, Dept Informat, Signal Proc Grp, D-22527 Hamburg, Germany
关键词
Noise reduction; signal reconstruction; speech enhancement; SQUARE ERROR ESTIMATION; A-PRIORI KNOWLEDGE; BAYESIAN-ESTIMATION; ENHANCEMENT; NOISE; REAL; SNR;
D O I
10.1109/TASLP.2016.2602549
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Among the most commonly used single-channel approaches for the enhancement of noise corrupted speech are Bayesian estimators of clean speech coefficients in the short-time Fourier transform domain. However, the vast majority of these approaches effectively only modifies the spectral amplitude and does not consider any information about the clean speech spectral phase. More recently, clean speech estimators that can utilize prior phase information have been proposed and shown to lead to improvements over the traditional, phase-blind approaches. In this work, we revisit phase-aware estimators of clean speech amplitudes and complex coefficients. To complete the existing set of estimators, we first derive a novel amplitude estimator given uncertain prior phase information. Second, we derive a closed-form solution for complex coefficients when the prior phase information is completely uncertain or not available. We put the novel estimators into the context of existing estimators and discuss their advantages and disadvantages.
引用
收藏
页码:2251 / 2262
页数:12
相关论文
共 34 条
  • [1] Speech enhancement using MMSE estimation of amplitude and complex speech spectral coefficients under phase-uncertainty
    Kandagatla, Ravi Kumar
    Subbaiah, P. V.
    [J]. SPEECH COMMUNICATION, 2018, 96 : 10 - 27
  • [2] Speech enhancement using MMSE estimation under phase uncertainty
    Kandagatla R.
    Subbaiah P.V.
    [J]. International Journal of Speech Technology, 2017, 20 (2) : 373 - 385
  • [3] MMSE Log-Spectral Amplitude Estimation for Single Channel Speech Enhancement under Speech Presence Uncertainty by Weibull Speech Priors
    Bahrami, Mojtaba
    Seyedin, Sanaz
    [J]. 26TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2018), 2018, : 749 - 754
  • [4] β-order MMSE spectral amplitude estimation for speech enhancement
    You, CH
    Koh, SN
    Rahardja, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 475 - 486
  • [5] MMSE Estimation of Speech Power Spectral Density Under Speech Presence Uncertainty for Automatic Speech Recognition
    Liu, Jingang
    Zhou, Yi
    Ma, Yongbao
    Liu, Hongqing
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2016, : 412 - 416
  • [6] Speech enhancement based on β-order MMSE estimation of Short Time Spectral Amplitude and Laplacian speech modeling
    Abutalebi, Hamid Reza
    Rashidinejad, Mehdi
    [J]. SPEECH COMMUNICATION, 2015, 67 : 92 - 101
  • [7] MMSE SPEECH SPECTRAL AMPLITUDE ESTIMATION ASSUMING NON-GAUSSIAN NOISE
    Fodor, Balazs
    Fingscheidt, Tim
    [J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2314 - 2318
  • [8] Microphone Array Based Speech Spectral Amplitude Estimators with Phase Estimation
    Parchami, Mahdi
    Zhu, Wei-Ping
    Champagne, Benoit
    [J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2014, : 133 - 136
  • [9] DeepMMSE: A Deep Learning Approach to MMSE-Based Noise Power Spectral Density Estimation
    Zhang, Qiquan
    Nicolson, Aaron
    Wang, Mingjiang
    Paliwal, Kuldip
    Wang, Chenxu
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1404 - 1415
  • [10] Use of speech presence uncertainty with MMSE spectral energy estimation for robust automatic speech recognition
    Stark, Anthony
    Paliwal, Kuldip
    [J]. SPEECH COMMUNICATION, 2011, 53 (01) : 51 - 61