Speech Stream Detection for Noisy Environments Based on Empirical Mode Decomposition

被引:0
|
作者
Tang Qiang [1 ]
Zhang Dexiang [1 ,2 ]
Yan Qing [2 ]
机构
[1] Anhui Univ, Key Lab Intelligent Comp & Signal Proc, Hefei 230039, Peoples R China
[2] Anhui Univ, Sch Elect Engn & Automat, Hefei 230601, Peoples R China
基金
美国国家科学基金会;
关键词
Speech stream detection; EMD; spectral entropy; Teager energy;
D O I
10.4028/www.scientific.net/AMM.397-400.2239
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A new approach for speech stream detection based on empirical mode decomposition (EMD) under a noisy environment is proposed. Accurate speech stream detection proves to significantly improve speech recognition performance under noise. The proposed algorithm relies on the Teager energy and spectral entropy characteristics of the signal to determine whether an input frame is speech or non-speech. Firstly, the noise signals can be decomposed into different numbers of sub-signals called intrinsic mode functions (IMFs) with the EMD. Then, spectral entropy is used to extract the desired feature for noisy IMF components and Teager energy is used to non-noisy IMF components. Finally, in order to show the effectiveness of the proposed method, we present examples showing that the new measure is more effective than traditional measures. The experiments show that the proposed algorithm can suppress different noise types with different SNR.
引用
收藏
页码:2239 / +
页数:2
相关论文
共 50 条
  • [41] Accent Extraction of Emotional Speech based on Modified Ensemble Empirical Mode Decomposition
    Shen, Zhiyuan
    Wang, Qiang
    Shen, Yi
    Jin, Jing
    Lin, Yurong
    [J]. 2010 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE I2MTC 2010, PROCEEDINGS, 2010,
  • [42] A robust endpoint detection of speech for noisy environments with application to automatic speech recognition
    Bou-Ghazale, SE
    Assaleh, K
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3808 - 3811
  • [43] Research of the Speech Signal Reconstruction at Empirical Mode Decomposition
    Priorov, A. L.
    Pavlovichev, P. O.
    Khryashchev, V. V.
    [J]. 2017 IEEE EAST-WEST DESIGN & TEST SYMPOSIUM (EWDTS), 2017,
  • [44] SPEECH ENHANCEMENT USING ADAPTIVE EMPIRICAL MODE DECOMPOSITION
    Chatlani, Navin
    Soraghan, John J.
    [J]. 2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 417 - 422
  • [45] A better decomposition of speech obtained using modified Empirical Mode Decomposition
    Sharma, Rajib
    Prasanna, S. R. Mahadeva
    [J]. DIGITAL SIGNAL PROCESSING, 2016, 58 : 26 - 39
  • [46] A speech stream detection in adverse acoustic environments based on cross correlation technique
    Zhang, Ru-bo
    Wu, Tian
    Li, Xue-yao
    Xu, Dong
    [J]. ADVANCES IN NATURAL COMPUTATION, PT 2, 2006, 4222 : 664 - 667
  • [47] A novel algorithm to robust speech endpoint detection in noisy environments
    Yi, Li
    Yingle, Fan
    [J]. ICIEA 2007: 2ND IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-4, PROCEEDINGS, 2007, : 1555 - 1558
  • [48] Extraction of time varying information from noisy signals: An approach based on the empirical mode decomposition
    Li, Chen
    Wang, Xinlong
    Tao, Zhiyong
    Wang, Qingfu
    Du, Shuanping
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2011, 25 (03) : 812 - 820
  • [49] A recognition–verification system for noisy faces based on an empirical mode decomposition with Green’s functions
    Saad Al-Baddai
    Pere Marti-Puig
    Esteve Gallego-Jutglà
    Karema Al-Subari
    Ana Maria Tomé
    Bernd Ludwig
    Elmar Wolfgang Lang
    Jordi Solé-Casals
    [J]. Soft Computing, 2020, 24 : 3809 - 3827
  • [50] Speech Emotion Recognition Based on EMD in Noisy Environments
    Chu, Yunyun
    Xiong, Weihua
    Chen, Wei
    [J]. ADVANCES IN CIVIL ENGINEERING AND BUILDING MATERIALS III, 2014, 831 : 460 - 464