Speech Endpoint Detection in Noisy Environments Using EMD and Teager Energy Operator

被引:4
|
作者
De-Xiang Zhang
机构
基金
中国国家自然科学基金;
关键词
Empirical mode decomposition; endpoint detection; noisy speech; Teager energy operator;
D O I
暂无
中图分类号
TN912.34 [语音识别与设备];
学科分类号
0711 ;
摘要
Accurate endpoint detection is a necessary capability for speech recognition. A new energy measure method based on the empirical mode decomposition (EMD) algorithm and Teager energy operator (TEO) is proposed to locate endpoint intervals of a speech signal embedded in noise. With the EMD, the noise signals can be decomposed into different numbers of sub-signals called intrinsic mode functions (IMFs), which is a zero-mean AM-FM component. Then TEO can be used to extract the desired feature of the modulation energy for IMF components. In order to show the effectiveness of the proposed method, examples are presented to show that the new measure is more effective than traditional measures. The present experimental results show that the measure can be used to improve the performance of endpoint detection algorithms and the accuracy of this algorithm is quite satisfactory and acceptable.
引用
收藏
页码:183 / 186
页数:4
相关论文
共 50 条
  • [21] Wavelet speech enhancement based on the Teager Energy operator
    Bahoura, M
    Rouat, J
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (01) : 10 - 12
  • [22] Speech Endpoint Detection in Noisy Environment Using Spectrogram Boundary Factor
    Wu, Di
    Tao, Zhi
    Wu, Yuanbo
    Shen, Cheng
    Xiao, Zhongzhe
    Zhang, Xiaojun
    Wu, Di
    Zhao, Heming
    [J]. 2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 964 - 968
  • [23] Speech enhancement using perceptual wavelet packet decomposition and teager energy operator
    Chen, SH
    Wang, JF
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 125 - 139
  • [24] Speech Enhancement Using Perceptual Wavelet Packet Decomposition and Teager Energy Operator
    Shi-Huang Chen
    Jhing-Fa Wang
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 125 - 139
  • [25] An improved speech endpoint detection system in noisy environments by means of third-order spectra
    Navarro-Mesa, J
    Moreno-Bilbao, A
    Lleida-Solano, E
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (09) : 224 - 226
  • [26] Robust speech endpoint detection based on MP3 file in various noisy environments
    Wang, Fang
    Huang, Xianglin
    Yang, Lifang
    Liu, Tao
    [J]. 2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 670 - 675
  • [27] DATA PROCESSING OF PULSATION SIGNALS OF A TURBINE BASED ON EMD AND TEAGER ENERGY OPERATOR
    Wang, Fengli
    Chen, Hua
    Gu, Aiguo
    Hu, Wei
    [J]. PROCEEDINGS OF THE ASME TURBO EXPO: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, 2018, VOL 7A, 2018, : 15 - 24
  • [28] Endpoint detection method of noisy Chinese speech recognition
    Wang, Peng
    Ta, Weina
    Chen, Shuzhong
    [J]. Jisuanji Gongcheng/Computer Engineering, 2003, 29 (17):
  • [29] Robust Speech Detection for Noisy Environments
    Varela, Oscar
    Indra, S. A.
    San-Segundo, Ruben
    Hernandez, Luis A.
    [J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2011, 26 (11) : 16 - U12
  • [30] Speech enhancement using empirical mode decomposition and the Teager-Kaiser energy operator
    Khaldi, Kais
    Boudraa, Abdel-Ouahab
    Komaty, Ali
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (01): : 451 - 459