Speech Enhancement Method with Geometric Phase Estimation By Incorporating MIXMAX Model

被引:1
|
作者
Wang, Xianyun [1 ]
Bao, Changchun [1 ]
机构
[1] Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/APSIPA.2016.7820908
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a frequency-domain speech enhancement algorithm with phase estimation, in which the speech model is modeled by a Gaussian mixture model (GMM) in the log-spectral domain and two closed-form log-spectral amplitude estimators for speech and noise are derived directly by using a Mixture-Maximum (MIXMAX) model. Because the accurate estimation of speech phase could help to reduce the undesired noise residues in the enhanced signal, our two log-spectral estimators are also used to construct a geometric approach for phase estimation in each frequency bin. In order to solve the ambiguity problem in phase estimation, we utilize the complex linear predictive analysis (CLPA) and inconsistency constraint to find an appropriate phase. Experimental results show that, in comparison with the reference methods, the proposed method achieves an efficient improvement in speech quality.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Relaxed statistical model for speech enhancement and a priori SNR estimation
    Cohen, I
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 870 - 881
  • [22] Incorporating Broad Phonetic Information for Speech Enhancement
    Lu, Yen-Ju
    Liao, Chien-Feng
    Lu, Xugang
    Hung, Jeih-weih
    Tsao, Yu
    [J]. INTERSPEECH 2020, 2020, : 2417 - 2421
  • [23] Incorporating Symbolic Sequential Modeling for Speech Enhancement
    Liao, Chien-Feng
    Tsao, Yu
    Lu, Xugang
    Kawai, Hisashi
    [J]. INTERSPEECH 2019, 2019, : 2733 - 2737
  • [24] ON PHASE IMPORTANCE IN PARAMETER ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT
    Mowlaee, Pejman
    Saeidi, Rahim
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7462 - 7466
  • [25] Microphone array speech enhancement by Bayesian estimation of spectral amplitude and phase
    Balan, R
    Rosca, J
    [J]. SAM2002: IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP PROCEEDINGS, 2002, : 209 - 213
  • [26] Speech Enhancement by Short-Time Spectrum Estimation with Multivariate Laplace Speech Model
    Zhou, Bin
    Zhang, Xiongwei
    Zou, Xia
    Zhao, Gaihua
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (12A): : 338 - 342
  • [27] PHASE ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT USING PHASE INVARIANCE CONSTRAINTS
    Pirolt, Michael
    Stahl, Johannes
    Mowlaee, Pejman
    Vorobiov, Vasili I.
    Barysenka, Siarhei Y.
    Davydov, Andrew G.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5585 - 5589
  • [28] A Data Field method for speech enhancement incorporating Binary Time-Frequency Masking
    Huang, Jianjun
    Zhang, Yafei
    Zhang, Xiongwei
    Zhu, Tao
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (07): : 225 - 229
  • [29] Speech enhancement using the modified phase-opponency model
    Deshmukh, Om D.
    Espy-Wilson, Carol Y.
    Carney, Laurel H.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (06): : 3886 - 3898
  • [30] Recursive estimation based on the trended hidden Markov model in speech enhancement
    Lee, KY
    Rheem, JY
    Shirai, K
    [J]. APCCAS '96 - IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS '96, 1996, : 239 - 242