Perceptual speech enhancement exploiting temporal masking properties of human auditory system

被引：15

作者：

Gunawan, Teddy Surya ^{[1
]}

Ambikairajah, Eliathamby ^{[2
]}

Epps, Julien ^{[2
]}

机构：

[1] Int Islamic Univ Malaysia, Dept Elect & Comp Engn, Kuala Lumpur 53100, Malaysia

[2] Univ New S Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia

来源：

SPEECH COMMUNICATION | 2010年 / 52卷 / 05期

关键词：

Human auditory system; Speech enhancement; Temporal masking; Subjective test; Objective test; NOISE; MODEL; SUPPRESSION; FREQUENCY; PHASE;

D O I：

10.1016/j.specom.2009.12.006

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The use of simultaneous masking in speech enhancement has shown promise for a range of noise types. In this paper, a new speech enhancement algorithm based on a short-term temporal masking threshold to noise ratio (MNR) is presented. A novel functional model for forward masking based on three parameters is incorporated into a speech enhancement framework based on speech boosting. The performance of the speech enhancement algorithm using the proposed forward masking model was compared with seven other speech enhancement methods over 12 different noise types and four SNRs. Objective evaluation using PESQ revealed that using the proposed forward masking model, the speech enhancement algorithm outperforms the other algorithms by 6-20% depending on the SNR. Moreover, subjective evaluation using 16 listeners confirmed the objective test results. (C) 2009 Elsevier B.V. All rights reserved.

引用

页码：381 / 393

页数：13

共 50 条

[1] A single channel speech enhancement technique exploiting human auditory masking properties
Nsabimana, F. X.
Subbaraman, V
Zoelzer, U.
[J]. ADVANCES IN RADIO SCIENCE, 2010, 8 : 95 - 99
[2] DCT Speech Enhancement Based on Masking Properties of Human Auditory System
Li Yang
Li Shuangtian
[J]. 2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 450 - 453
[3] Single channel speech enhancement based on masking properties of the human auditory system
Virag, N
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (02): : 126 - 137
[4] Improved method for speech enhancement based on human auditory masking properties
School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
[J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2008, 37 (02): : 255 - 257
[5] Millimeter wave conduct speech enhancement based on auditory masking properties
Li, S.
Wang, J. Q.
Niu, M.
Liu, T.
Jing, X. J.
[J]. MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 2008, 50 (08) : 2109 - 2114
[6] A Modified Spectral Subtraction Method for Speech Enhancement Based on Masking Property of Human Auditory System
Xia, Bing-yin
Liang, Yan
Bao, Chang-chun
[J]. 2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 942 - 946
[7] Enhancement of electrolarynx speech based on auditory masking
Liu, HJ
Zhao, Q
Wan, MX
Wang, SP
[J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2006, 53 (05) : 865 - 874
[8] Speech enhancement using perceptual wavelet thresholding with the Ephraim and Malah noise suppressor and auditory masking
Parajuli, Ashish
DeBrunner, Victor
[J]. 2005 39th Asilomar Conference on Signals, Systems and Computers, Vols 1 and 2, 2005, : 301 - 304
[9] An Optimal Speech Enhancement under Speech Uncertainty Probability and Masking Property of Auditory System
Huang, Xiaoshan
Zhao, Xiaoqun
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 373 - +
[10] Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system
Hansen, John H. L.
Radhakrishnan, Vinod
Arehart, Kathryn Hoberg
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2049 - 2063

← 1 2 3 4 5 →