Impulsive Noise Detection for Speech Enhancement in HHT Domain

被引:2
|
作者
Medina, C. [1 ]
Coelho, R. [1 ]
Zao, L. [1 ]
机构
[1] Mil Inst Engn IME, Lab Acoust Signal Proc, BR-22290270 Rio De Janeiro, RJ, Brazil
关键词
Speech enhancement; Noise measurement; Acoustic noise; Indexes; Time-domain analysis; Estimation; Signal to noise ratio; impulsive noises; Hilbert-Huang transform; non-stationary acoustic noises; INTELLIGIBILITY IMPROVEMENT; ACOUSTIC NOISE;
D O I
10.1109/TASLP.2021.3093392
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a novel single channel speech enhancement method in the time domain to mitigate the effects of acoustic impulsive noises. The ensemble empirical mode decomposition is applied to analyze the noisy speech signal. The estimation and selection of noise components is based on the impulsiveness index of decomposition modes. An adaptive threshold is proposed to define the criterion to select the noise components. The proposed method is evaluated in speech enhancement experiments considering four acoustic noises with different impulsiveness indices and non-stationarity degrees under various signal-to-noise ratios. Four speech enhancement algorithms are adopted as baseline in the evaluation analysis considering spectral and time domains. Seven objective measures are adopted to compare the proposed and baseline approaches in terms of speech quality and intelligibility. Results show that the proposed solution outperforms the competing algorithms for most of the noisy scenarios. The novel method shows particularly interesting performance when speech signals are corrupted by highly impulsive acoustic noises.
引用
收藏
页码:2244 / 2253
页数:10
相关论文
共 50 条
  • [21] Noise robust speech recognition by combining speech enhancement in the wavelet domain and Lin-log RASTA
    Yang Jie
    Wang Zhenli
    2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL II, 2009, : 415 - +
  • [22] Interactive Speech and Noise Modeling for Speech Enhancement
    Zheng, Chengyu
    Peng, Xiulian
    Zhang, Yuan
    Srinivasan, Sriram
    Lu, Yan
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14549 - 14557
  • [23] Speech Enhancement Based on 4th Order Cumulant Backward Linear Predictor for Impulsive Noise
    Sasaoka, Naoto
    Ono, Kazumasa
    Itoh, Yoshio
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 127 - 131
  • [24] Impulsive noise removal image enhancement technique
    Mohapatra, Subrajeet
    Sa, Pankaj Kumar
    Majhi, Banshidhar
    PROCEEDINGS OF THE WSEAS INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, ELECTRONICS, CONTROL & SIGNAL PROCESSING: SELECTED TOPICS ON CIRCUITS, SYSTEMS, ELECTRONICS, CONTROL & SIGNAL PROCESSING, 2007, : 317 - 322
  • [25] Evolutionary Image Enhancement for Impulsive Noise Reduction
    Cho, Ung-Keun
    Hong, Jin-Hyuk
    Cho, Sung-Bae
    INTELLIGENT COMPUTING, PART I: INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, ICIC 2006, PART I, 2006, 4113 : 678 - 683
  • [26] SPEECH DETECTION IN NOISE
    VENEKLASEN, PS
    CHRISTOFF, JP
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1960, 32 (11): : 1502 - 1502
  • [27] Speech enhancement based on dynamic noise estimation within auto-correlation domain
    Wu, YD
    Wu, XH
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 281 - 284
  • [28] SPEECH ENHANCEMENT BASED ON DYNAMIC NOISE ESTIMATION WITHIN AUTO-CORRELATION DOMAIN
    吴亚栋
    吴旭辉
    Journal of Shanghai Jiaotong University(Science), 2002, (02) : 211 - 214
  • [29] A cepstrum domain HMM-based speech enhancement method applied to nonstationary noise
    Nilsson, M
    Dahl, M
    Claesson, I
    SIGNAL PROCESSING FOR TELECOMMUNICATIONS AND MULTIMEDIA, 2005, 27 : 1 - 13
  • [30] Robust treatment of impulsive noise in speech and audio signals
    Godsill, SJ
    Rayner, PJW
    BAYESIAN ROBUSTNESS, 1996, 29 : 331 - 342