Endpoint detection of noisy speech based on cepstrum

被引:0
|
作者
Hu, Guangrui [1 ]
Wei, Xiaodong [1 ]
机构
[1] Shanghai Jiaotong Univ, Shanghai, China
来源
关键词
Algorithms - Markov processes - Models - Signal detection - Signal to noise ratio - Spurious signal noise;
D O I
暂无
中图分类号
学科分类号
摘要
A major cause of errors in automatic speech recognition (ASR) systems is the inaccurate detection of the beginning and ending boundaries of test and reference patterns. Accurate determination of endpoints of speech is not very difficult if the SNR is high. Unfortunately, most practical ASR systems must work with a small SNR, and the conventional speech detection methods based on some simple features, such as energy cannot work well in noisy environments. In this paper, cepstrum is used as the feature to detect the voice activity. Two algorithms for endpoint detection of noisy speech signal are proposed. The first one takes the cepstral distance as the decision thresholds instead of short-time energy. The second approach modified the HMM-based speech detector to make it adaptive to the change of noise. The experiments show that high accurate rates can be obtained.
引用
收藏
页码:95 / 97
相关论文
共 50 条
  • [21] Robust Speech Endpoint Detection in Noisy Environments for HRI (Human-Robot Interface)
    Park, Jin-Soo
    Ko, Han-Seok
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (02): : 147 - 156
  • [22] Endpoint detection of isolated Korean utterances for bimodal speech recognition in acoustic noisy environments
    Oh, HH
    Kwon, HS
    Son, JM
    Bae, KS
    Chien, SI
    FOUNDATIONS OF INTELLIGENT SYSTEMS, 2003, 2871 : 585 - 592
  • [23] Cepstrum third-order normalisation method for noisy speech recognition
    Suk, YH
    Choi, SH
    Lee, HS
    ELECTRONICS LETTERS, 1999, 35 (07) : 527 - 528
  • [24] Speech endpoint detection based on recurrence rate analysis
    Yan, Run-Qiang
    Zhu, Yi-Sheng
    Tongxin Xuebao/Journal on Communications, 2007, 28 (01): : 35 - 39
  • [25] A Speech Endpoint Detection Algorithm Based on Wavelet Transforms
    Cao Yali
    La Dongsheng
    Jia Shuo
    Niu Xuefen
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 3010 - 3012
  • [26] Speech Endpoint Detection with Low SNR Based on HHTSM
    Liu Baisen
    Zhang Ye
    Zhang Wulin
    PROCEEDINGS OF 2013 IEEE 11TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS (ICEMI), 2013, : 116 - 119
  • [27] Cepstrum third-order normalization method for noisy speech recognition
    Department of Electrical Engineering, Korea Adv. Inst. Sci. and Technol., 373-1 Kusong-Dong, Yusong-Gu, Taejon 305-701, Korea, Republic of
    不详
    不详
    Electron. Lett., 7 (527-528):
  • [28] An improved speech endpoint detection system in noisy environments by means of third-order spectra
    Navarro-Mesa, J
    Moreno-Bilbao, A
    Lleida-Solano, E
    IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (09) : 224 - 226
  • [29] Speech Endpoint Detection Using Gradient Based Edge Detection Techniques
    Ghaemmaghami, Houman
    Vogt, Robert
    Sridharan, Sridha
    Mason, Michael
    ICSPCS: 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, PROCEEDINGS, 2008, : 373 - 380
  • [30] SVM-based speech endpoint detection using contextual speech features
    Ramirez, J.
    Yelamos, R.
    Gorriz, J. M.
    Segura, J. C.
    ELECTRONICS LETTERS, 2006, 42 (07) : 426 - 428