Endpoint detection of noisy speech based on cepstrum

被引:0
|
作者
Hu, Guangrui [1 ]
Wei, Xiaodong [1 ]
机构
[1] Shanghai Jiaotong Univ, Shanghai, China
来源
关键词
Algorithms - Markov processes - Models - Signal detection - Signal to noise ratio - Spurious signal noise;
D O I
暂无
中图分类号
学科分类号
摘要
A major cause of errors in automatic speech recognition (ASR) systems is the inaccurate detection of the beginning and ending boundaries of test and reference patterns. Accurate determination of endpoints of speech is not very difficult if the SNR is high. Unfortunately, most practical ASR systems must work with a small SNR, and the conventional speech detection methods based on some simple features, such as energy cannot work well in noisy environments. In this paper, cepstrum is used as the feature to detect the voice activity. Two algorithms for endpoint detection of noisy speech signal are proposed. The first one takes the cepstral distance as the decision thresholds instead of short-time energy. The second approach modified the HMM-based speech detector to make it adaptive to the change of noise. The experiments show that high accurate rates can be obtained.
引用
收藏
页码:95 / 97
相关论文
共 50 条
  • [1] Endpoint detection of noisy speech by the use of cepstrum
    Wei, Xiaodong
    Hu, Guangrui
    Ren, Xiaolin
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2000, 34 (02): : 185 - 188
  • [2] Speech Endpoint Detection Method Based on TEO in Noisy Environment
    Li Jie
    Zhou Ping
    Jing Xinxing
    Du Zhiran
    2012 INTERNATIONAL WORKSHOP ON INFORMATION AND ELECTRONICS ENGINEERING, 2012, 29 : 2655 - 2660
  • [3] Noisy Speech Endpoint Detection Using Robust Feature
    Ouzounov, Atanas
    BIOMETRIC AUTHENTICATION (BIOMET 2014), 2014, 8897 : 105 - 117
  • [4] Endpoint detection method of noisy Chinese speech recognition
    Wang, Peng
    Ta, Weina
    Chen, Shuzhong
    Jisuanji Gongcheng/Computer Engineering, 2003, 29 (17):
  • [5] Speech Endpoint Detection in Noisy Environment Based on the Ensemble Empirical Mode Decomposition
    Li, Jingjiao
    An, Dong
    Wang, Jiao
    Rong, Chaoqun
    MECHATRONICS AND INFORMATION TECHNOLOGY, PTS 1 AND 2, 2012, 2-3 : 135 - 139
  • [6] Speech Endpoint Detection Based on EMD and Higher Order Statistics in Noisy Environments
    Zhang, Dexiang
    Li, Jiaxing
    Chen, Zihong
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING FOR MECHANICS AND MATERIALS, 2015, 21 : 1101 - 1104
  • [7] A formant frequency estimator for noisy speech based on correlation and cepstrum
    Dept. of Electrical and Computer Engineering, Concordia University, 1455 De Maisonneuve Blvd. W., Montreal, QC H3G 1M8, Canada
    Can Acoust, 2008, 3 (160-161):
  • [8] A novel algorithm to robust speech endpoint detection in noisy environments
    Yi, Li
    Yingle, Fan
    ICIEA 2007: 2ND IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-4, PROCEEDINGS, 2007, : 1555 - 1558
  • [9] A robust endpoint detection of speech for noisy environments with application to automatic speech recognition
    Bou-Ghazale, SE
    Assaleh, K
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3808 - 3811
  • [10] Speech Endpoint Detection in Strong Noisy Environment Based on the Hilbert-Huang Transform
    Lu, Zhimao
    Liu, Baisen
    Shen, Liran
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 4322 - +