Speech enhancement based on Bayesian decision and spectral amplitude estimation

被引:0
|
作者
Feng Deng
Chang-Chun Bao
机构
[1] Beijing University of Technology,Speech and Audio Signal Processing Lab, School of Electronic Information and Control Engineering
关键词
Speech enhancement; Bayesian decision; Spectral amplitude estimation; Combined Bayesian risk function; General weighted cost function;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a single-channel speech enhancement method based on Bayesian decision and spectral amplitude estimation is proposed, in which the speech detection module and spectral amplitude estimation module are included, and the two modules are strongly coupled. First, under the decisions of speech presence and speech absence, the optimal speech amplitude estimators are obtained by minimizing a combined Bayesian risk function, respectively. Second, using the obtained spectral amplitude estimators, the optimal speech detector is achieved by further minimizing the combined Bayesian risk function. Finally, according to the detection results of speech detector, the optimal decision rule is made and the optimal spectral amplitude estimator is chosen for enhancing noisy speech. Furthermore, by considering both detection and estimation errors, we propose a combined cost function which incorporates two general weighted distortion measures for the speech presence and speech absence of the spectral amplitudes, respectively. The cost parameters in the cost function are employed to balance the speech distortion and residual noise caused by missed detection and false alarm, respectively. In addition, we propose two adaptive calculation methods for the perceptual weighted order p and the spectral amplitude order β concerned in the proposed cost function, respectively. The objective and subjective test results indicate that the proposed method can achieve a more significant segmental signal-noise ratio (SNR) improvement, a lower log-spectral distortion, and a better speech quality than the reference methods.
引用
收藏
相关论文
共 50 条
  • [1] Speech enhancement based on Bayesian decision and spectral amplitude estimation
    Deng, Feng
    Bao, Chang-Chun
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
  • [2] BAYESIAN SPECTRAL AMPLITUDE ESTIMATION FOR SPEECH ENHANCEMENT WITH CORRELATED SPECTRAL COMPONENTS
    Plourde, Eric
    Champagne, Benoit
    [J]. 2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 397 - 400
  • [3] Multichannel speech enhancement using Bayesian spectral amplitude estimation
    Lotter, T
    Benien, C
    Vary, P
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 880 - 883
  • [4] Microphone array speech enhancement by Bayesian estimation of spectral amplitude and phase
    Balan, R
    Rosca, J
    [J]. SAM2002: IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP PROCEEDINGS, 2002, : 209 - 213
  • [5] Generalized Bayesian Estimators of the Spectral Amplitude for Speech Enhancement
    Plourde, Eric
    Champagne, Benoit
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (06) : 485 - 488
  • [6] β-order MMSE spectral amplitude estimation for speech enhancement
    You, CH
    Koh, SN
    Rahardja, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 475 - 486
  • [7] A Speech Enhancement Method by Coupling Speech Detection and Spectral Amplitude Estimation
    Deng, Feng
    Bao, Chang-Chun
    Bao, Feng
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3233 - 3237
  • [8] Speech enhancement based on a microphone array and log-spectral amplitude estimation
    Cohen, I
    Berdugo, B
    [J]. 22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 4 - 6
  • [9] Generalized maximum a posteriori spectral amplitude estimation for speech enhancement
    Tsao, Yu
    Lai, Ying-Hui
    [J]. SPEECH COMMUNICATION, 2016, 76 : 112 - 126
  • [10] Distributed multichannel speech enhancement based on perceptually-motivated Bayesian estimators of the spectral amplitude
    Trawicki, Marek B.
    Johnson, Michael T.
    [J]. IET SIGNAL PROCESSING, 2013, 7 (04) : 337 - 344