Maximum a Posteriori Speech Enhancement Based on Double Spectrum

被引：0

作者：

Mowlaee, Pejman ^{[1
,2
]}

Scheran, Daniel ^{[2
]}

Stahl, Johannes ^{[2
]}

Wood, Sean U. N. ^{[2
]}

Kleijn, W. Bastiaan ^{[3
]}

机构：

[1] Widex AS, Nymollevej 6, DK-3540 Lynge, Denmark

[2] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria

[3] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand

来源：

INTERSPEECH 2019 | 2019年

基金：

奥地利科学基金会;

关键词：

Speech Enhancement; Modulation Domain Processing; Double Spectrum; MAP Estimator; QUALITY ASSESSMENT; INTELLIGIBILITY;

D O I：

10.21437/Interspeech.2019-1197

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.

引用

下载

页码：2738 / 2742

页数：5

共 50 条

[31] Speech enhancement algorithm based on wavelet packet adaptive threshold revised by posteriori SNR
Zhang, Xueying
Ren, Yongmei
Jia, Hairong
Zhongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Central South University (Science and Technology), 2013, 44 (11): : 4566 - 4573
[32] Anomaly detection based on maximum a posteriori
Li, Shifeng
Liu, Chunxiao
Yang, Yuqiang
PATTERN RECOGNITION LETTERS, 2018, 107 : 91 - 97
[33] Constrained Structural Maximum A Posteriori Linear Regression for Average-Voice-Based Speech Synthesis
Nakano, Yuji
Tachibana, Makoto
Yamagishi, Junichi
Kobayashi, Takao
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2286 - 2289
[34] Maximum a posteriori based adaptive algorithms
Huang, Dong-Yan
Rahardja, Susanto
Huang, Haibin
CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 1628 - 1632
[35] NOISE POWER SPECTRUM ESTIMATION BASED ON WEAK SPEECH PROTECTION FOR SPEECH ENHANCEMENT
Feng, Yan
An, Baokun
2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 484 - 487
[36] Double Adversarial Network based Monaural Speech Enhancement for Robust Speech Recognition
Du, Zhihao
Han, Jiqing
Zhang, Xueliang
INTERSPEECH 2020, 2020, : 309 - 313
[37] Speech enhancement based on second order architecture and maximum entropy algorithm
Yu, X
Hu, GR
ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 572 - 575
[38] Speech enhancement based on adaptive wavelet denoising on multitaper spectrum
Hsung, Tai-Chiu
Lun, Daniel Pak-Kong
PROCEEDINGS OF 2008 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-10, 2008, : 1700 - 1703
[39] Speech enhancement based on multitaper spectrum and psychoacoustical weighting rule
WU Hongwei WU Zhenyang ZHAO Li ( College of Information Science and Engineering
Chinese Journal of Acoustics, 2007, (03) : 278 - 288
[40] Speech Enhancement Based on Noise-Compensated Phase Spectrum
Islam, Md. T.
Shahnaz, C.
2014 1ST INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION & COMMUNICATION TECHNOLOGY (ICEEICT 2014), 2014,

← 1 2 3 4 5 →