Endpoint detection of noisy speech based on cepstrum

被引:0
|
作者
Hu, Guangrui [1 ]
Wei, Xiaodong [1 ]
机构
[1] Shanghai Jiaotong Univ, Shanghai, China
来源
关键词
Algorithms - Markov processes - Models - Signal detection - Signal to noise ratio - Spurious signal noise;
D O I
暂无
中图分类号
学科分类号
摘要
A major cause of errors in automatic speech recognition (ASR) systems is the inaccurate detection of the beginning and ending boundaries of test and reference patterns. Accurate determination of endpoints of speech is not very difficult if the SNR is high. Unfortunately, most practical ASR systems must work with a small SNR, and the conventional speech detection methods based on some simple features, such as energy cannot work well in noisy environments. In this paper, cepstrum is used as the feature to detect the voice activity. Two algorithms for endpoint detection of noisy speech signal are proposed. The first one takes the cepstral distance as the decision thresholds instead of short-time energy. The second approach modified the HMM-based speech detector to make it adaptive to the change of noise. The experiments show that high accurate rates can be obtained.
引用
收藏
页码:95 / 97
相关论文
共 50 条
  • [31] Cepstrum-domain acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments
    Kim, HK
    Rose, RC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 435 - 446
  • [32] Speech Endpoint Detection Based on Fractal Dimension with Adaptive Threshold
    Zheng Y.
    Gao S.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2020, 41 (01): : 7 - 11
  • [33] GRACED: A Novel Fragile Watermarking for Speech Based on Endpoint Detection
    Zhou, Shuyun
    Song, Meixin
    Qian, Qing
    Liao, Wenjing
    Gong, Xiaofeng
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [34] Speech endpoint detection based on the formant-consonance energy
    Department of Science and Technology of Electronics, University of Science and Technology of China, Hefei 230027, China
    Qinghua Daxue Xuebao, 2008, SUPPL. 1 (754-759):
  • [35] Research of MEL Cepstrum Frequency Domain Energy Endpoint Detection Methods
    Dai, Sheng-can
    Li, Yin-guo
    Xu, Yang
    2011 AASRI CONFERENCE ON APPLIED INFORMATION TECHNOLOGY (AASRI-AIT 2011), VOL 2, 2011, : 24 - 27
  • [36] Robust speech endpoint detection based on fluctuation complexity measure
    Fan, Yingle
    Li, Yi
    Pang, Quan
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 2847 - 2850
  • [37] The Automatic Detection of Hypernasality in Cleft Palate Speech Based on an Improved Cepstrum Method
    Fu, Fang-Ling
    Qian, Jia-Hui
    He, Fei
    Yin, Heng
    Wang, Xi-Yue
    He, Ling
    PROCEEDINGS OF THE 3RD ANNUAL INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND INFORMATION SCIENCE (EEEIS 2017), 2017, 131 : 426 - 430
  • [38] On prefiltering and endpoint detection of speech signal
    He, Q
    Zhang, YW
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 749 - 752
  • [39] An overview of speech endpoint detection algorithms
    Zhang, Tao
    Shao, Yangyang
    Wu, Yaqin
    Geng, Yanzhang
    Fan, Long
    APPLIED ACOUSTICS, 2020, 160 (160)
  • [40] Speech endpoint detection based on speech time-frequency enhancement and spectral entropy
    Fan Yingle
    Li Yi
    Wu Chuanyan
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 4682 - 4684