β-Masking MMSE Speech Enhancement for Speech Recognition

被引:0
|
作者
You, Chang Huai [1 ]
Ma, Bin [1 ]
机构
[1] ASTAR, Inst Infocomm Res I2R, Human Language Technol, 1 Fusionopolis Way, Singapore, Singapore
关键词
speech enhancement; speech recognition; masking threshold; SPECTRAL AMPLITUDE ESTIMATOR; NOISE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognition performance deteriorates drastically when they are deployed in practical situation where the speech is corrupted by additive noise. One way to improve the robustness of the speech recognition system is to enhance the speech prior to its recognition. This paper focuses on developing a masking-based ss-order minimum mean square error (ss-masking MMSE) speech enhancement for speech recognition under noise condition. Addressing the artifacts introduced by enhancement algorithm and the remaining noise after denoising, we modified the estimation algorithm of spectral parameters for the ss-masking MMSE by controlling the power of processing noise, strengthening the weak signal processing, oversuppressing the residual noise and reestimating a priori SNR. The evaluation shows the proposed enhancement scheme is significantly effective to improve the performance of state-of-the-art speech recognition.
引用
收藏
页码:341 / 345
页数:5
相关论文
共 50 条
  • [1] An MMSE speech enhancement approach incorporating masking properties
    You, CH
    Koh, SN
    Rahardja, S
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 725 - 728
  • [2] Masking-based β-order MMSE speech enhancement
    You, CH
    Koh, SN
    Rahardja, S
    [J]. SPEECH COMMUNICATION, 2006, 48 (01) : 57 - 70
  • [3] A LSA-MMSE speech enhancement approach incorporating masking properties
    Chen, Qi
    Guo, Ying
    Duan, Yanli
    Wang, Bo
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 455 - +
  • [4] Speech Enhancement Based on Masking Approach Considering Speech Quality and Acoustic Confidence for Noisy Speech Recognition
    Chu, Shih-Chuan
    Wu, Chung-Hsien
    Lin, Yun-Wen
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 536 - 540
  • [5] Post-processing in masking-based β-order MMSE speech enhancement
    Zhang, Xinxin
    Koh, Soo Ngee
    Soon, Ing Yann
    You, Changhuai
    [J]. APPLIED ACOUSTICS, 2008, 69 (04) : 354 - 357
  • [6] An Improved MMSE-LSA speech enhancement algorithm based on human auditory masking property
    Zhang, Yong
    Liu, Yi
    [J]. 2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 151 - 154
  • [7] Combining MMSE enhancement with LA model adaptation for robust automatic speech recognition
    Ding, P
    Cao, ZG
    [J]. ELECTRONICS LETTERS, 2001, 37 (08) : 539 - 540
  • [8] Adaptive β-order MMSE estimation for speech enhancement
    You, CH
    Koh, S
    Rahardja, S
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 900 - 903
  • [9] NETWORKS FOR SPEECH ENHANCEMENT AND AUTOMATIC SPEECH RECOGNITION
    Vu, Thanh T.
    Bigot, Benjamin
    Chng, Eng Siong
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 499 - 503
  • [10] SPEECH ENHANCEMENT FOR TELEPHONY NAME SPEECH RECOGNITION
    You, Chang Huai
    Rahardja, Susanto
    Li, Haizhou
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 973 - 976