Maximum a Posteriori Speech Enhancement Based on Double Spectrum

被引：0

作者：

Mowlaee, Pejman ^{[1
,2
]}

Scheran, Daniel ^{[2
]}

Stahl, Johannes ^{[2
]}

Wood, Sean U. N. ^{[2
]}

Kleijn, W. Bastiaan ^{[3
]}

机构：

[1] Widex AS, Nymollevej 6, DK-3540 Lynge, Denmark

[2] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria

[3] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand

来源：

INTERSPEECH 2019 | 2019年

基金：

奥地利科学基金会;

关键词：

Speech Enhancement; Modulation Domain Processing; Double Spectrum; MAP Estimator; QUALITY ASSESSMENT; INTELLIGIBILITY;

D O I：

10.21437/Interspeech.2019-1197

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.

引用

下载

页码：2738 / 2742

页数：5

共 50 条

[21] MODULATION SPECTRUM BASED BEAMFORMING FOR SPEECH ENHANCEMENT
Karimian-Azari, Sam
Falk, Tiago H.
2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 91 - 95
[22] Psychoacoustical enhancement of speech based on multitaper spectrum
School of Information Science and Engineering, Southeast University, Nanjing 210096, China
不详
Shengxue Xuebao, 2007, 3 (275-281):
[23] Improved minima controlled recursive averaging technique using conditional maximum a posteriori criterion for speech enhancement
Kum, Jong-Mo
Park, Yun-Sik
Chang, Joon-Hyuk
DIGITAL SIGNAL PROCESSING, 2010, 20 (06) : 1572 - 1578
[24] MMSE and maximum a posteriori estimators for speech enhancement in additive noise assuming a t-location-scale clean speech prior
Faraji, Neda
Kohansal, Akram
IET SIGNAL PROCESSING, 2018, 12 (04) : 532 - 543
[25] Speech enhancement based on double RBF networks
Guo, Jichang
Guo, Libin
CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 357 - 360
[26] Speech enhancement based on stationary bionic wavelet transform and maximum a posterior estimator of magnitude-squared spectrum
Mourad T.
International Journal of Speech Technology, 2017, 20 (1) : 75 - 88
[27] Single Channel Speech Separation Using Maximum a Posteriori Estimation
Radfar, M. H.
Dansereau, R. M.
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 841 - 844
[28] Design of Speech Enhancement Platform Based on Spectrum Subtraction
Jiang, Jing-sai
Fan, Yan-hong
Wang, Ya-chen
2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 3, 2011, : 48 - 51
[29] Speech enhancement based on wavelet thresholding the multitaper spectrum
Hu, Y
Loizou, PC
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 59 - 67
[30] Speech Enhancement Based on Noise Compensated Magnitude Spectrum
Islam, Md. T.
Hussain, A. B.
Shahid, K. T.
Saha, U.
Shahnaz, C.
2014 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2014,

← 1 2 3 4 5 →