Speech Enhancement Based on Minima Controlled Recursive Averaging Technique Incorporating Conditional MAP

被引:0
|
作者
Kum, Jong-Mo
Park, Yun-Sik
Chang, Joon-Hyuk
机构
来源
关键词
minima controlled recursive averaging (MCRA); conditional maximum a posteriori (Conditional MAP);
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel approach to improve the performance of minima controlled recursive averaging (MCRA) which is based on the conditional maximum a posteriori criterion, A crucial component of a practical speech enhancement system is the estimation of the noise power spectrum, One state-of-the-art approach is the minima controlled recursive averaging (MCRA) technique, The noise estimate in the MCRA technique is obtained by averaging past spectral power values based on a smoothing parameter that is adjusted by the signal presence probability in frequency subbands. We improve the MCRA using the speech presence probability which is the a posteriori probability conditioned on both the current observation the speech presence or absence of the previous frame, With the performance criteria of the ITU-T P, 862 perceptual evaluation of speech quality (PESQ) and subjective evaluation of speech quality, we show that the proposed algorithm yields better results compared to the conventional MCRA-based scheme.
引用
收藏
页码:256 / 261
页数:6
相关论文
共 50 条
  • [41] ICA-based MAP speech enhancement with multiple variable speech distribution models
    Zou, Xin
    Jancovic, Peter
    Kokuer, Muenevver
    Russell, Martin
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 415 - 418
  • [42] EMD based Clear Recursive Thresholding (EMD-CRT) for speech enhancement
    Saggurti, Nageswara Rao
    Shankar, Jaya
    2015 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTING AND CONTROL (ISPCC), 2015, : 149 - 154
  • [43] Single-Channel Speech Enhancement With Phase Reconstruction Based on Phase Distortion Averaging
    Wakabayashi, Yukoh
    Fukumori, Takahiro
    Nakayama, Masato
    Nishiura, Takanobu
    Yamashita, Yoichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1559 - 1569
  • [44] Incorporating group update for speech enhancement based on convolutional gated recurrent network
    Yuan, Wenhao
    SPEECH COMMUNICATION, 2021, 132 : 32 - 39
  • [45] INCORPORATING REAL-WORLD NOISY SPEECH IN NEURAL-NETWORK-BASED SPEECH ENHANCEMENT SYSTEMS
    Xia, Yangyang
    Xu, Buye
    Kumar, Anurag
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 564 - 570
  • [46] Voice Activity Detection Based on Generalized Normal-Laplace Distribution Incorporating Conditional MAP
    Song, Ji-Hyun
    Lee, Sangmin
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (12) : 2888 - 2891
  • [47] Directed Searching Optimization-Based Speech Enhancement Technique
    Kumar, Sandeep
    FLUCTUATION AND NOISE LETTERS, 2020, 19 (04):
  • [48] Single Channel Speech Enhancement using a 9 Dimensional Noise Estimation Algorithm and Controlled Forward March Averaging
    Farrokhi, Dariush
    Togneri, Roberto
    Zaknich, Anthony
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 17 - 21
  • [49] MAP estimation for noisy speech enhancement based on inter-frame correlation
    Ou, Shi-Feng
    Zhao, Xiao-Hui
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (10): : 2007 - 2013
  • [50] A variant of SWEMDH technique based on variational mode decomposition for speech enhancement
    Selvaraj, Poovarasan
    Chandra, E.
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2021, 25 (03) : 299 - 308