Perceptual speech enhancement exploiting temporal masking properties of human auditory system

被引:15
|
作者
Gunawan, Teddy Surya [1 ]
Ambikairajah, Eliathamby [2 ]
Epps, Julien [2 ]
机构
[1] Int Islamic Univ Malaysia, Dept Elect & Comp Engn, Kuala Lumpur 53100, Malaysia
[2] Univ New S Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia
关键词
Human auditory system; Speech enhancement; Temporal masking; Subjective test; Objective test; NOISE; MODEL; SUPPRESSION; FREQUENCY; PHASE;
D O I
10.1016/j.specom.2009.12.006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The use of simultaneous masking in speech enhancement has shown promise for a range of noise types. In this paper, a new speech enhancement algorithm based on a short-term temporal masking threshold to noise ratio (MNR) is presented. A novel functional model for forward masking based on three parameters is incorporated into a speech enhancement framework based on speech boosting. The performance of the speech enhancement algorithm using the proposed forward masking model was compared with seven other speech enhancement methods over 12 different noise types and four SNRs. Objective evaluation using PESQ revealed that using the proposed forward masking model, the speech enhancement algorithm outperforms the other algorithms by 6-20% depending on the SNR. Moreover, subjective evaluation using 16 listeners confirmed the objective test results. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:381 / 393
页数:13
相关论文
共 50 条
  • [1] A single channel speech enhancement technique exploiting human auditory masking properties
    Nsabimana, F. X.
    Subbaraman, V
    Zoelzer, U.
    [J]. ADVANCES IN RADIO SCIENCE, 2010, 8 : 95 - 99
  • [2] DCT Speech Enhancement Based on Masking Properties of Human Auditory System
    Li Yang
    Li Shuangtian
    [J]. 2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 450 - 453
  • [3] Single channel speech enhancement based on masking properties of the human auditory system
    Virag, N
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (02): : 126 - 137
  • [4] Improved method for speech enhancement based on human auditory masking properties
    School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
    [J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2008, 37 (02): : 255 - 257
  • [5] Millimeter wave conduct speech enhancement based on auditory masking properties
    Li, S.
    Wang, J. Q.
    Niu, M.
    Liu, T.
    Jing, X. J.
    [J]. MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 2008, 50 (08) : 2109 - 2114
  • [6] A Modified Spectral Subtraction Method for Speech Enhancement Based on Masking Property of Human Auditory System
    Xia, Bing-yin
    Liang, Yan
    Bao, Chang-chun
    [J]. 2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 942 - 946
  • [7] Enhancement of electrolarynx speech based on auditory masking
    Liu, HJ
    Zhao, Q
    Wan, MX
    Wang, SP
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2006, 53 (05) : 865 - 874
  • [8] Speech enhancement using perceptual wavelet thresholding with the Ephraim and Malah noise suppressor and auditory masking
    Parajuli, Ashish
    DeBrunner, Victor
    [J]. 2005 39th Asilomar Conference on Signals, Systems and Computers, Vols 1 and 2, 2005, : 301 - 304
  • [9] An Optimal Speech Enhancement under Speech Uncertainty Probability and Masking Property of Auditory System
    Huang, Xiaoshan
    Zhao, Xiaoqun
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 373 - +
  • [10] Speech enhancement based on generalized minimum mean square error estimators and masking properties of the auditory system
    Hansen, John H. L.
    Radhakrishnan, Vinod
    Arehart, Kathryn Hoberg
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06): : 2049 - 2063