A Risk-Estimation-Based Comparison of Mean Square Error and Itakura-Saito Distortion Measures for Speech Enhancement

被引:0
|
作者
Muraka, Nagarjuna Reddy [1 ]
Seelamantula, Chandra Sekhar [1 ]
机构
[1] Indian Inst Sci, Dept Elect Engn, Bangalore 560012, Karnataka, India
关键词
Stein's unbiased risk estimator; Mean square error; Itakura-Saito distortion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of speech enhancement algorithms is to provide an estimate of clean speech starting from noisy observations. In general, the estimate is obtained by minimizing a chosen distortion metric. The often-employed cost is the mean-square error (MSE), which results in a Wiener-filter solution. Since the ground truth is not available in practice, the practical utility of the optimal estimators is limited. Alternative, one can optimize an unbiased estimate of the MSE. This is the key idea behind Stein's unbiased risk estimation (SURE) principle. Within this framework, we derive SURE solutions for the MSE and Itakura-Saito (IS) distortion measures. We also propose parametric versions of the corresponding SURE estimators, which give additional flexibility in controlling the attenuation characteristics for maximum signal-to-noise-ratio (SNR) gain. We compare the performance of the two distortion measures in terms of attenuation profiles, average segmental SNR, global SNR, and spectrograms. We also include a comparison with the standard power spectral subtraction technique. The results show that the IS distortion consistently gives better performance gain in all respects. The perceived quality of the enhanced speech is also better in case of the IS metric.
引用
收藏
页码:356 / 359
页数:4
相关论文
共 16 条
  • [1] Speech Enhancement Based on Discrete Wavelet Packet Transform and Itakura-Saito Nonnegative Matrix Factorisation
    Liu, Houguang
    Wang, Wenbo
    Xue, Lin
    Yang, Jianhua
    Wang, Zhihua
    Hua, Chunli
    [J]. ARCHIVES OF ACOUSTICS, 2020, 45 (04) : 565 - 572
  • [2] Speech enhancement based on minimum mean-square error estimation and supergaussian priors
    Martin, R
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 845 - 856
  • [3] Noise Estimation Using Mean Square Cross Prediction Error for Speech Enhancement
    Wang, Gang
    Li, Chunguang
    Dong, Le
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2010, 57 (07) : 1489 - 1499
  • [4] Semi-parametric joint detection and estimation for speech enhancement based on minimum mean square error
    Van-Khanh Mai
    Pastor, Dominique
    Aissa-El-Bey, Abdeldjalil
    Le Bidan, Raphael
    [J]. SPEECH COMMUNICATION, 2018, 102 : 27 - 38
  • [5] Speech Enhancement By Minimum Mean-Square Error Spectral Amplitude Estimation Assuming Weibull Speech Priors
    Bahrami, Mojtaba
    Faraji, Neda
    [J]. 2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 190 - 194
  • [6] Signal Enhancement Algorithm Based on Minimum Mean Square Error Spectrum Estimation
    Lv Yunfei
    Liu Bing
    Yang Yue
    Mei Jidan
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2018,
  • [7] Speech enhancement based on minimum mean-square error short-time spectral estimation and its realization
    Liu, ZB
    Xu, NP
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT PROCESSING SYSTEMS, VOLS 1 & 2, 1997, : 1794 - 1797
  • [8] Monaural Speech Separation Based on Gain Adapted Minimum Mean Square Error Estimation
    Radfar, M. H.
    Dansereau, R. M.
    Chan, W-Y.
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2010, 61 (01): : 21 - 37
  • [9] Monaural Speech Separation Based on Gain Adapted Minimum Mean Square Error Estimation
    M. H. Radfar
    R. M. Dansereau
    W.-Y. Chan
    [J]. Journal of Signal Processing Systems, 2010, 61 : 21 - 37
  • [10] Beta-order minimum mean-square error multichannel spectral amplitude estimation for speech enhancement
    Trawicki, M. B.
    Johnson, M. T.
    [J]. INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2015, 29 (10) : 1287 - 1295