LOG-SPECTRAL AMPLITUDE ESTIMATION WITH GENERALIZED GAMMA DISTRIBUTIONS FOR SPEECH ENHANCEMENT

被引:0
|
作者
Borgstroem, Bengt J. [1 ]
Alwan, Abeer [1 ]
机构
[1] Univ Calif Los Angeles, Dept Elect Engn, Los Angeles, CA 90024 USA
关键词
Speech Enhancement; Log-spectral Amplitude Estimator; Generalized Gamma Distribution; SQUARE ERROR ESTIMATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a family of log-spectral amplitude (LSA) estimators for speech enhancement. Generalized Gamma distributed (GGD) priors are assumed for speech short-time spectral amplitudes (STSAs), providing mathematical flexibility in capturing the statistical behavior of speech. Although solutions are not obtainable in closed-form, estimators are expressed as limits, and can be efficiently approximated. When applied to the Noizeus database [1], proposed estimators are shown to provide improvements in segmental signal-to-noise ratio (SSNR) and COSH distance [2], relative to the LSA estimator proposed by Ephraim and Malah [3].
引用
收藏
页码:4756 / 4759
页数:4
相关论文
共 50 条
  • [31] SPEECH ENHANCEMENT USING GENERALIZED MAXIMUM A POSTERIORI SPECTRAL AMPLITUDE ESTIMATOR
    Su, Yu-Cheng
    Tsao, Yu
    Wu, Jung-En
    Jean, Fu-Rong
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7467 - 7471
  • [32] NOISE ESTIMATION USING A CONSTRAINED SEQUENTIAL HMM IN LOG-SPECTRAL DOMAIN
    Ying, Dongwen
    Lu, Xugang
    Li, Junfeng
    Yan, Yonghong
    Dang, Jianwu
    Soong, Frank
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4553 - 4556
  • [33] Robust Speech Recognition Using MLP Neural Network in Log-Spectral Domain
    Ghaemmaghami, Masoumeh P.
    Sameti, Hossein
    Razzazi, Farbod
    BabaAli, Bagher
    Dabbaghchian, Saeed
    2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 467 - +
  • [34] Microphone array speech enhancement by Bayesian estimation of spectral amplitude and phase
    Balan, R
    Rosca, J
    SAM2002: IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP PROCEEDINGS, 2002, : 209 - 213
  • [35] Log-spectral feature reconstruction based on an occlusion model for noise robust speech recognition
    Gonzalez, Jose A.
    Peinado, Antonio M.
    Gomez, Angel M.
    Ma, Ning
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2629 - 2632
  • [36] Bayesian decision theoretic scale-adaptive estimation of a log-spectral density
    Pensky, Marianna
    Vidakovic, Brani
    De Canditiis, Daniela
    STATISTICA SINICA, 2007, 17 (02) : 635 - 666
  • [37] Speech Enhancement by MAP Spectral Amplitude Estimation Using a Super-Gaussian Speech Model
    Thomas Lotter
    Peter Vary
    EURASIP Journal on Advances in Signal Processing, 2005
  • [38] Speech enhancement by MAP spectral amplitude estimation using a super-Gaussian speech model
    Lotter, T
    Vary, P
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (07) : 1110 - 1126
  • [39] Multichannel direction-independent speech enhancement using spectral amplitude estimation
    Lotter, T
    Benien, C
    Vary, P
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (11) : 1147 - 1156
  • [40] Multichannel Direction-Independent Speech Enhancement Using Spectral Amplitude Estimation
    Thomas Lotter
    Christian Benien
    Peter Vary
    EURASIP Journal on Advances in Signal Processing, 2003