Speech endpoint identification based on empirical mode decomposition

被引:1
|
作者
Yao, Zhen-Jie [1 ]
Huang, Hai [1 ]
Chen, Xiang-Xian [1 ]
机构
[1] Department of Instrumentation Science and Engineering, Zhejiang University, Hangzhou 310027, China
关键词
Acoustic noise - Speech recognition - Numerical methods - Speech communication - Audio signal processing - Intrinsic mode functions;
D O I
10.3785/j.issn.1008-973X.2009.04.019
中图分类号
学科分类号
摘要
A new method based on the empirical mode decomposition (EMD) was proposed to identify speech-segment endpoints in noise-contaminated speech signals. Noisy speech signals were decomposed into a set of intrinsic mode functions (IMFs) using EMD. The average instantaneous frequencies of IMFs were estimated by their short time zero cross rate. The frames with low and slowly changing average instantaneous frequencies were identified to be the periodic sonant segments and the frames with high average instantaneous frequencies were identified to be the surd segments based on the characteristics of the average instantaneous frequencies of IMFs derived from speech signals. The final speech signals were obtained by processing and combining these segments. The numerical and experimental results show that the method can effectively identify the endpoints for the speeches contaminated by noises seriously.
引用
收藏
页码:705 / 709
相关论文
共 50 条
  • [1] Endpoint detection of speech signal based on empirical mode decomposition and Teager kurtosis
    Zhang, Dexiang
    Wu, Xiaopei
    Lv, Zhao
    Guo, Xiaojing
    [J]. Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2010, 31 (03): : 493 - 499
  • [2] Speech Endpoint Detection in Noisy Environment Based on the Ensemble Empirical Mode Decomposition
    Li, Jingjiao
    An, Dong
    Wang, Jiao
    Rong, Chaoqun
    [J]. MECHATRONICS AND INFORMATION TECHNOLOGY, PTS 1 AND 2, 2012, 2-3 : 135 - 139
  • [3] A noise robust endpoint detection algorithm for whispered speech based on Empirical Mode Decomposition and entropy
    Tan, Xue-Dan
    Gu, Ji-Hua
    Zhao, He-Ming
    Tao, Zhi
    [J]. 2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 355 - 359
  • [4] Usable speech detection based on empirical mode decomposition
    Ghezaiel, W.
    Ben Slimanne, A.
    Ben Braiek, E.
    [J]. ELECTRONICS LETTERS, 2013, 49 (07) : 503 - 504
  • [5] Empirical Mode Decomposition for Speech Enhancement
    Bouchair, Asma
    Amrouche, Abderrahmane
    Selouani, Sid-Ahmed
    Hamidia, Mahfoud
    [J]. PROCEEDINGS 2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2018, : 653 - 656
  • [6] Empirical Mode Decomposition for Usable Speech Detection
    Ghezaiel, Wajdi
    Ben Slimane, Amel
    Ben Braiek, Ezzedine
    [J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 316 - 320
  • [7] Empirical mode decomposition of voiced speech signal
    Bouzid, A
    Ellouze, N
    [J]. ISCCSP : 2004 FIRST INTERNATIONAL SYMPOSIUM ON CONTROL, COMMUNICATIONS AND SIGNAL PROCESSING, 2004, : 603 - 606
  • [8] Voiced speech analysis by empirical mode decomposition
    Bouzid, Aicha
    Ellouze, Noureddine
    [J]. ADVANCES IN NONLINEAR SPEECH PROCESSING, 2007, 4885 : 213 - +
  • [9] Empirical Mode Decomposition Based Reconstruction of Speech Signal in Noisy Environment
    Goswami, Nisha
    Sarma, Mousmita
    Sarma, Kandarpa Kumar
    [J]. 2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2014, : 760 - 765
  • [10] Speech Stream Detection for Noisy Environments Based on Empirical Mode Decomposition
    Tang Qiang
    Zhang Dexiang
    Yan Qing
    [J]. ADVANCED DESIGN AND MANUFACTURING TECHNOLOGY III, PTS 1-4, 2013, 397-400 : 2239 - +