Generalized maximum a posteriori spectral amplitude estimation for speech enhancement

被引:34
|
作者
Tsao, Yu [1 ]
Lai, Ying-Hui [1 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat CITI, Sect 2, 128,Acad Rd, Taipei 11529, Taiwan
关键词
Speech enhancement; Spectral restoration; Generalized MAPA; GMAPA; NOISE-REDUCTION; ONLINE ESTIMATION; SUPPRESSION; PROBABILITY; COMPRESSION; FILTER; POWER;
D O I
10.1016/j.specom.2015.10.003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spectral restoration methods for speech enhancement aim to remove noise components in noisy speech signals by using a gain function in the spectral domain. How to design the gain function is one of the most important parts for obtaining enhanced speech with good quality. In most studies, the gain function is designed by optimizing a criterion based on some assumptions of the noise and speech distributions, such as minimum mean square error (MMSE), maximum likelihood (ML), and maximum a posteriori (MAP) criteria. The MAP criterion shows advantage in obtaining a more reliable gain function by incorporating a suitable prior density. However, it has a problem as several studies showed: although MAP based estimator effectively reduces noise components when the signal-to-noise ratio (SNR) is low, it brings large speech distortion when the SNR is high. For solving this problem, we have proposed a generalized maximum a posteriori spectral amplitude (GMAPA) algorithm in designing a gain function for speech enhancement. The proposed GMAPA algorithm dynamically specifies the weight of prior density of speech spectra according to the SNR of the testing speech signals to calculate the optimal gain function. When the SNR is high, GMAPA adopts a small weight to prevent overcompensations that may result in speech distortions. On the other hand, when the SNR is low, GMAPA uses a large weight to avoid disturbance of the restoration caused by measurement noises. In our previous study, it has been proven that the weight of the prior density plays a crucial role to the GMAPA performance, and the weight is determined based on the SNR in an utterance-level. In this paper, we propose to compute the weight with the consideration of time frequency correlations that result in a more accurate estimation of the gain function. Experiments were carried out to evaluate the proposed algorithm on both objective tests and subjective tests. The experimental results obtained from objective tests indicate that GMAPA is promising compared to several well-known algorithms at both high and low SNRs. The results of subjective listening tests indicate that GMAPA provides significantly higher sound quality than other speech enhancement algorithms. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:112 / 126
页数:15
相关论文
共 50 条
  • [1] SPEECH ENHANCEMENT USING GENERALIZED MAXIMUM A POSTERIORI SPECTRAL AMPLITUDE ESTIMATOR
    Su, Yu-Cheng
    Tsao, Yu
    Wu, Jung-En
    Jean, Fu-Rong
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7467 - 7471
  • [2] EVALUATION OF GENERALIZED MAXIMUM A POSTERIORI SPECTRAL AMPLITUDE (GMAPA) SPEECH ENHANCEMENT ALGORITHM IN HEARING AIDS
    Lai, Ying-Hui
    Su, Yu-Cheng
    Tsao, Yu
    Young, Shuenn-Tsong
    [J]. 2013 IEEE 17TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE), 2013, : 245 - +
  • [3] LOG-SPECTRAL AMPLITUDE ESTIMATION WITH GENERALIZED GAMMA DISTRIBUTIONS FOR SPEECH ENHANCEMENT
    Borgstroem, Bengt J.
    Alwan, Abeer
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4756 - 4759
  • [4] Multi-band Spectral Subtraction of Speech Enhancement Based on Maximum Posteriori Phase Estimation
    Li, Zhen
    Wu, Wenjin
    Zhang, Qin
    Ren, Hui
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2017, 39 (09): : 2282 - 2286
  • [5] Generalization of Maximum A Posteriori Amplitude Estimator Under Speech Presence Uncertainty for Speech Enhancement
    Momeni, Hajar
    Abutalebi, Hamid Reza
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2014, 33 (08) : 2565 - 2582
  • [6] Generalization of Maximum A Posteriori Amplitude Estimator Under Speech Presence Uncertainty for Speech Enhancement
    Hajar Momeni
    Hamid Reza Abutalebi
    [J]. Circuits, Systems, and Signal Processing, 2014, 33 : 2565 - 2582
  • [7] Generalized Bayesian Estimators of the Spectral Amplitude for Speech Enhancement
    Plourde, Eric
    Champagne, Benoit
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (06) : 485 - 488
  • [8] β-order MMSE spectral amplitude estimation for speech enhancement
    You, CH
    Koh, SN
    Rahardja, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 475 - 486
  • [9] A Speech Enhancement Method by Coupling Speech Detection and Spectral Amplitude Estimation
    Deng, Feng
    Bao, Chang-Chun
    Bao, Feng
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3233 - 3237
  • [10] BAYESIAN SPECTRAL AMPLITUDE ESTIMATION FOR SPEECH ENHANCEMENT WITH CORRELATED SPECTRAL COMPONENTS
    Plourde, Eric
    Champagne, Benoit
    [J]. 2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 397 - 400