Approximate Entropy and Empirical Mode Decomposition for Improved Speaker Recognition

被引:1
|
作者
Metzger, Richard A. [1 ,3 ]
Doherty, John F. [1 ]
Jenkins, David M. [2 ]
Hall, Donald L. [2 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, University Pk, PA 16802 USA
[2] Penn State Univ, Appl Res Lab, University Pk, PA 16802 USA
[3] SAIC, Reston, VA USA
关键词
Empirical mode decomposition; approximate entropy; speaker identification; speech processing; NOISE;
D O I
10.1142/S2424922X20500114
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
When processing real-world recordings of speech, it is highly probable noise will be present at some instance in the signal. Compounding this problem is the situation when the noise occurs in short, impulsive bursts at random intervals. Traditional signal processing methods used to detect speech rely on the spectral energy of the incoming signal to make a determination whether or not a segment of the signal contains speech. However when noise is present, this simple energy detection is prone to falsely flagging noise as speech. This paper will demonstrate an alternative way of processing a noisy speech signal utilizing a combination of information theoretic and signal processing principles to differentiate speech segments from noise. The utilization of this preprocessing technique will allow a speaker recognition system to train statistical speaker model using noise-corrupted speech files, and construct models statistically similar to those constructed from noise-free data. This preprocessing method will be shown to outperform traditional spectrum-based methods for both low-entropy and high-entropy noise in low signal-to-noise ratio environments, with a reduction in the feature space distortion when measured using the Cauchy-Schwarz (CS) distance metric.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Applications of improved empirical mode decomposition in machinery fault diagnosis
    Ma, Wenpeng
    Zhang, Junhong
    Ma, Liang
    Liu, Yu
    Jia, Xiaojie
    [J]. Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement and Diagnosis, 2015, 35 (04): : 637 - 644
  • [42] Fault Diagnosis for Gearbox Based on Improved Empirical Mode Decomposition
    Zhao, Ling
    Huang, Darong
    Qin, Yi
    [J]. SHOCK AND VIBRATION, 2015, 2015
  • [43] Improved Wind Speed Prediction Using Empirical Mode Decomposition
    Zhang, Yagang
    Zhang, Chenhong
    Sun, Jingbin
    Guo, Jingjing
    [J]. ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2018, 18 (02) : 3 - 10
  • [44] An improved empirical mode decomposition by using dyadic masking signals
    Yang, Yanli
    Deng, Jiahao
    Kang, Dali
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2015, 9 (06) : 1259 - 1263
  • [45] Improved Extrema Detection Algorithm for the Generalized Empirical Mode Decomposition
    Kovalenko, P. Y.
    Bliznyuk, D., I
    Berdin, A. S.
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING, APPLICATIONS AND MANUFACTURING (ICIEAM), 2016,
  • [46] An improved empirical mode decomposition by using dyadic masking signals
    Yanli Yang
    Jiahao Deng
    Dali Kang
    [J]. Signal, Image and Video Processing, 2015, 9 : 1259 - 1263
  • [47] Improved Empirical Mode Decomposition Based on Harmonic Wavelet Filter
    Liu, Yang
    Liu, Yong
    Yu, Shunjing
    Yang, Jiefeng
    [J]. MEASUREMENT TECHNOLOGY AND ITS APPLICATION, PTS 1 AND 2, 2013, 239-240 : 946 - 949
  • [48] An Improved Empirical Mode Decomposition of Electroencephalogram Signals for Depression Detection
    Shen, Jian
    Zhang, Xiaowei
    Wang, Gang
    Ding, Zhijie
    Hu, Bin
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (01) : 262 - 271
  • [49] The prediction for London gold price: improved empirical mode decomposition
    Hua, Qiuling
    Jiang, Tingfeng
    [J]. APPLIED ECONOMICS LETTERS, 2015, 22 (17) : 1404 - 1408
  • [50] Hybrid speech enhancement with empirical mode decomposition and spectral subtraction for efficient speaker identification
    El-Moneim, Samia
    Dessouky, Moawad
    El-Samie, Fathi
    Nassar, M.
    El-Naby, Mohammed
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (04) : 555 - 564