β-Masking MMSE Speech Enhancement for Speech Recognition

被引：0

作者：

You, Chang Huai ^{[1
]}

Ma, Bin ^{[1
]}

机构：

[1] ASTAR, Inst Infocomm Res I2R, Human Language Technol, 1 Fusionopolis Way, Singapore, Singapore

来源：

2017 IEEE 2ND INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP) | 2017年

关键词：

speech enhancement; speech recognition; masking threshold; SPECTRAL AMPLITUDE ESTIMATOR; NOISE;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech recognition performance deteriorates drastically when they are deployed in practical situation where the speech is corrupted by additive noise. One way to improve the robustness of the speech recognition system is to enhance the speech prior to its recognition. This paper focuses on developing a masking-based ss-order minimum mean square error (ss-masking MMSE) speech enhancement for speech recognition under noise condition. Addressing the artifacts introduced by enhancement algorithm and the remaining noise after denoising, we modified the estimation algorithm of spectral parameters for the ss-masking MMSE by controlling the power of processing noise, strengthening the weak signal processing, oversuppressing the residual noise and reestimating a priori SNR. The evaluation shows the proposed enhancement scheme is significantly effective to improve the performance of state-of-the-art speech recognition.

引用

页码：341 / 345

页数：5

共 50 条

[1] An MMSE speech enhancement approach incorporating masking properties
You, CH
Koh, SN
Rahardja, S
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 725 - 728
[2] Masking-based β-order MMSE speech enhancement
You, CH
Koh, SN
Rahardja, S
[J]. SPEECH COMMUNICATION, 2006, 48 (01) : 57 - 70
[3] A LSA-MMSE speech enhancement approach incorporating masking properties
Chen, Qi
Guo, Ying
Duan, Yanli
Wang, Bo
[J]. 2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 455 - +
[4] Speech Enhancement Based on Masking Approach Considering Speech Quality and Acoustic Confidence for Noisy Speech Recognition
Chu, Shih-Chuan
Wu, Chung-Hsien
Lin, Yun-Wen
[J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 536 - 540
[5] Post-processing in masking-based β-order MMSE speech enhancement
Zhang, Xinxin
Koh, Soo Ngee
Soon, Ing Yann
You, Changhuai
[J]. APPLIED ACOUSTICS, 2008, 69 (04) : 354 - 357
[6] An Improved MMSE-LSA speech enhancement algorithm based on human auditory masking property
Zhang, Yong
Liu, Yi
[J]. 2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 151 - 154
[7] Combining MMSE enhancement with LA model adaptation for robust automatic speech recognition
Ding, P
Cao, ZG
[J]. ELECTRONICS LETTERS, 2001, 37 (08) : 539 - 540
[8] Adaptive β-order MMSE estimation for speech enhancement
You, CH
Koh, S
Rahardja, S
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 900 - 903
[9] NETWORKS FOR SPEECH ENHANCEMENT AND AUTOMATIC SPEECH RECOGNITION
Vu, Thanh T.
Bigot, Benjamin
Chng, Eng Siong
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 499 - 503
[10] SPEECH ENHANCEMENT FOR TELEPHONY NAME SPEECH RECOGNITION
You, Chang Huai
Rahardja, Susanto
Li, Haizhou
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 973 - 976

← 1 2 3 4 5 →