Speech Enhancement Method with Geometric Phase Estimation By Incorporating MIXMAX Model

被引：1

作者：

Wang, Xianyun ^{[1
]}

Bao, Changchun ^{[1
]}

机构：

[1] Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

来源：

2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2016年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/APSIPA.2016.7820908

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose a frequency-domain speech enhancement algorithm with phase estimation, in which the speech model is modeled by a Gaussian mixture model (GMM) in the log-spectral domain and two closed-form log-spectral amplitude estimators for speech and noise are derived directly by using a Mixture-Maximum (MIXMAX) model. Because the accurate estimation of speech phase could help to reduce the undesired noise residues in the enhanced signal, our two log-spectral estimators are also used to construct a geometric approach for phase estimation in each frequency bin. In order to solve the ambiguity problem in phase estimation, we utilize the complex linear predictive analysis (CLPA) and inconsistency constraint to find an appropriate phase. Experimental results show that, in comparison with the reference methods, the proposed method achieves an efficient improvement in speech quality.

引用

页数：4

共 50 条

[21] Relaxed statistical model for speech enhancement and a priori SNR estimation
Cohen, I
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 870 - 881
[22] Incorporating Broad Phonetic Information for Speech Enhancement
Lu, Yen-Ju
Liao, Chien-Feng
Lu, Xugang
Hung, Jeih-weih
Tsao, Yu
[J]. INTERSPEECH 2020, 2020, : 2417 - 2421
[23] Incorporating Symbolic Sequential Modeling for Speech Enhancement
Liao, Chien-Feng
Tsao, Yu
Lu, Xugang
Kawai, Hisashi
[J]. INTERSPEECH 2019, 2019, : 2733 - 2737
[24] ON PHASE IMPORTANCE IN PARAMETER ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT
Mowlaee, Pejman
Saeidi, Rahim
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7462 - 7466
[25] Microphone array speech enhancement by Bayesian estimation of spectral amplitude and phase
Balan, R
Rosca, J
[J]. SAM2002: IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP PROCEEDINGS, 2002, : 209 - 213
[26] Speech Enhancement by Short-Time Spectrum Estimation with Multivariate Laplace Speech Model
Zhou, Bin
Zhang, Xiongwei
Zou, Xia
Zhao, Gaihua
[J]. PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (12A): : 338 - 342
[27] PHASE ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT USING PHASE INVARIANCE CONSTRAINTS
Pirolt, Michael
Stahl, Johannes
Mowlaee, Pejman
Vorobiov, Vasili I.
Barysenka, Siarhei Y.
Davydov, Andrew G.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5585 - 5589
[28] A Data Field method for speech enhancement incorporating Binary Time-Frequency Masking
Huang, Jianjun
Zhang, Yafei
Zhang, Xiongwei
Zhu, Tao
[J]. PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (07): : 225 - 229
[29] Speech enhancement using the modified phase-opponency model
Deshmukh, Om D.
Espy-Wilson, Carol Y.
Carney, Laurel H.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (06): : 3886 - 3898
[30] Recursive estimation based on the trended hidden Markov model in speech enhancement
Lee, KY
Rheem, JY
Shirai, K
[J]. APCCAS '96 - IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS '96, 1996, : 239 - 242

← 1 2 3 4 5 →