Maximum a Posteriori Speech Enhancement Based on Double Spectrum

被引:0
|
作者
Mowlaee, Pejman [1 ,2 ]
Scheran, Daniel [2 ]
Stahl, Johannes [2 ]
Wood, Sean U. N. [2 ]
Kleijn, W. Bastiaan [3 ]
机构
[1] Widex AS, Nymollevej 6, DK-3540 Lynge, Denmark
[2] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria
[3] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
来源
基金
奥地利科学基金会;
关键词
Speech Enhancement; Modulation Domain Processing; Double Spectrum; MAP Estimator; QUALITY ASSESSMENT; INTELLIGIBILITY;
D O I
10.21437/Interspeech.2019-1197
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.
引用
下载
收藏
页码:2738 / 2742
页数:5
相关论文
共 50 条
  • [31] Speech enhancement algorithm based on wavelet packet adaptive threshold revised by posteriori SNR
    Zhang, Xueying
    Ren, Yongmei
    Jia, Hairong
    Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2013, 44 (11): : 4566 - 4573
  • [32] Anomaly detection based on maximum a posteriori
    Li, Shifeng
    Liu, Chunxiao
    Yang, Yuqiang
    PATTERN RECOGNITION LETTERS, 2018, 107 : 91 - 97
  • [33] Constrained Structural Maximum A Posteriori Linear Regression for Average-Voice-Based Speech Synthesis
    Nakano, Yuji
    Tachibana, Makoto
    Yamagishi, Junichi
    Kobayashi, Takao
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2286 - 2289
  • [34] Maximum a posteriori based adaptive algorithms
    Huang, Dong-Yan
    Rahardja, Susanto
    Huang, Haibin
    CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 1628 - 1632
  • [35] NOISE POWER SPECTRUM ESTIMATION BASED ON WEAK SPEECH PROTECTION FOR SPEECH ENHANCEMENT
    Feng, Yan
    An, Baokun
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 484 - 487
  • [36] Double Adversarial Network based Monaural Speech Enhancement for Robust Speech Recognition
    Du, Zhihao
    Han, Jiqing
    Zhang, Xueliang
    INTERSPEECH 2020, 2020, : 309 - 313
  • [37] Speech enhancement based on second order architecture and maximum entropy algorithm
    Yu, X
    Hu, GR
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 572 - 575
  • [38] Speech enhancement based on adaptive wavelet denoising on multitaper spectrum
    Hsung, Tai-Chiu
    Lun, Daniel Pak-Kong
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1700 - 1703
  • [39] Speech enhancement based on multitaper spectrum and psychoacoustical weighting rule
    WU Hongwei WU Zhenyang ZHAO Li ( College of Information Science and Engineering
    Chinese Journal of Acoustics, 2007, (03) : 278 - 288
  • [40] Speech Enhancement Based on Noise-Compensated Phase Spectrum
    Islam, Md. T.
    Shahnaz, C.
    2014 1ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION & COMMUNICATION TECHNOLOGY (ICEEICT 2014), 2014,