Endpoint detection of noisy speech based on cepstrum

被引:0
|
作者
Hu, Guangrui [1 ]
Wei, Xiaodong [1 ]
机构
[1] Shanghai Jiaotong Univ, Shanghai, China
来源
关键词
Algorithms - Markov processes - Models - Signal detection - Signal to noise ratio - Spurious signal noise;
D O I
暂无
中图分类号
学科分类号
摘要
A major cause of errors in automatic speech recognition (ASR) systems is the inaccurate detection of the beginning and ending boundaries of test and reference patterns. Accurate determination of endpoints of speech is not very difficult if the SNR is high. Unfortunately, most practical ASR systems must work with a small SNR, and the conventional speech detection methods based on some simple features, such as energy cannot work well in noisy environments. In this paper, cepstrum is used as the feature to detect the voice activity. Two algorithms for endpoint detection of noisy speech signal are proposed. The first one takes the cepstral distance as the decision thresholds instead of short-time energy. The second approach modified the HMM-based speech detector to make it adaptive to the change of noise. The experiments show that high accurate rates can be obtained.
引用
收藏
页码:95 / 97
相关论文
共 50 条
  • [41] Endpoint detection in noisy environment using complexity measure
    Li, Yi
    Fan, Ying-Le
    Tong, Qin-Ye
    2007 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1-4, PROCEEDINGS, 2007, : 1004 - +
  • [42] ENDPOINT DETECTOR OF NOISY SPEECH SIGNAL USING A RECURRENT NEURAL NETWORK
    韦晓东
    胡光锐
    JournalofShanghaiJiaotongUniversity, 1999, (01) : 60 - 63
  • [43] Cepstrum-based deconvolution for speech dereverberation
    Subramaniam, S
    Petropulu, AP
    Wendt, C
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (05): : 392 - 396
  • [44] Improved speech endpoint detection algorithm based on energy-entropy
    Cui Zhi-dong
    Li Ying
    Tao Min
    Proceedings of the 2007 Chinese Control and Decision Conference, 2007, : 331 - 333
  • [45] Towards Speech-Based Collaboration Detection in a Noisy Classroom
    Shahrokhian, Bahar
    VanLehn, Kurt
    ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS AND DOCTORAL CONSORTIUM, PT II, 2022, 13356 : 65 - 70
  • [46] Study on Speech Endpoint Detection Algorithm Based on Wavelet Energy Entropy
    Yali Cao
    Jing Gao
    Guang Yang
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 3965 - 3969
  • [47] Speech Endpoint Detection Based on Improvement Feature and S-Transform
    Lu Xunbo
    Zhu Chunli
    Li Xin
    INTELLIGENT COMPUTING AND INTERNET OF THINGS, PT II, 2018, 924 : 225 - 235
  • [48] Robust endpoint detection for speech recognition based on discriminative feature extraction
    Yamamoto, Koichi
    Jabloun, Firas
    Reinhard, Klaus
    Kawamura, Akinori
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 805 - 808
  • [49] An Improved Speech Endpoint Detection Based on Box-counting Dimension
    Gao Hongbin
    Liu He
    Gao Kai
    Mang Yongqiang
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 337 - 340
  • [50] Speech endpoint detection based on the dynamic segmentation of power spectral envelope
    Xu, Chun-Dong
    Wang, Jing
    Zhang, Ge
    Ying, Dong-Wen
    Li, Jun-Feng
    Yan, Yong-Hong
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2015, 35 (11): : 1189 - 1193