Maximum a Posteriori Speech Enhancement Based on Double Spectrum

被引:0
|
作者
Mowlaee, Pejman [1 ,2 ]
Scheran, Daniel [2 ]
Stahl, Johannes [2 ]
Wood, Sean U. N. [2 ]
Kleijn, W. Bastiaan [3 ]
机构
[1] Widex AS, Nymollevej 6, DK-3540 Lynge, Denmark
[2] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria
[3] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
来源
基金
奥地利科学基金会;
关键词
Speech Enhancement; Modulation Domain Processing; Double Spectrum; MAP Estimator; QUALITY ASSESSMENT; INTELLIGIBILITY;
D O I
10.21437/Interspeech.2019-1197
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.
引用
下载
收藏
页码:2738 / 2742
页数:5
相关论文
共 50 条
  • [21] MODULATION SPECTRUM BASED BEAMFORMING FOR SPEECH ENHANCEMENT
    Karimian-Azari, Sam
    Falk, Tiago H.
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 91 - 95
  • [22] Psychoacoustical enhancement of speech based on multitaper spectrum
    School of Information Science and Engineering, Southeast University, Nanjing 210096, China
    不详
    Shengxue Xuebao, 2007, 3 (275-281):
  • [23] Improved minima controlled recursive averaging technique using conditional maximum a posteriori criterion for speech enhancement
    Kum, Jong-Mo
    Park, Yun-Sik
    Chang, Joon-Hyuk
    DIGITAL SIGNAL PROCESSING, 2010, 20 (06) : 1572 - 1578
  • [24] MMSE and maximum a posteriori estimators for speech enhancement in additive noise assuming a t-location-scale clean speech prior
    Faraji, Neda
    Kohansal, Akram
    IET SIGNAL PROCESSING, 2018, 12 (04) : 532 - 543
  • [25] Speech enhancement based on double RBF networks
    Guo, Jichang
    Guo, Libin
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 357 - 360
  • [26] Speech enhancement based on stationary bionic wavelet transform and maximum a posterior estimator of magnitude-squared spectrum
    Mourad T.
    International Journal of Speech Technology, 2017, 20 (1) : 75 - 88
  • [27] Single Channel Speech Separation Using Maximum a Posteriori Estimation
    Radfar, M. H.
    Dansereau, R. M.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 841 - 844
  • [28] Design of Speech Enhancement Platform Based on Spectrum Subtraction
    Jiang, Jing-sai
    Fan, Yan-hong
    Wang, Ya-chen
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 3, 2011, : 48 - 51
  • [29] Speech enhancement based on wavelet thresholding the multitaper spectrum
    Hu, Y
    Loizou, PC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 59 - 67
  • [30] Speech Enhancement Based on Noise Compensated Magnitude Spectrum
    Islam, Md. T.
    Hussain, A. B.
    Shahid, K. T.
    Saha, U.
    Shahnaz, C.
    2014 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2014,