Speech detection in non-stationary noise based on the 1/f process

被引:2
|
作者
Wang, F [1 ]
Zheng, F [1 ]
Wu, WH [1 ]
机构
[1] Tsing Hua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Ctr Speech Technol, Beijing 100084, Peoples R China
关键词
speech detection; 1/f process; wavelet; robust speech recognition;
D O I
10.1007/BF02949828
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an effective and robust active speech detection method is proposed based on the 1/f process technique for signals under non-stationary noisy environments. The Gaussian 1/f process, a mathematical model for statistically self-similar random processes based on fractals, is selected to model both the speech and the background noise. An optimal Bayesian two-class classifier is developed to discriminate them by their 1/f wavelet coefficients with Karhunen-Loeve-type properties. Multiple templates are trained for the speech signal, and the parameters of the background noise can be dynamically adapted in runtime to model the variation of both the speech and the noise. In our experiments, a 10-minute long speech with different types of noises ranging from 20dB to 5dB is tested using this new detection method. A high performance with over 90% detection accuracy is achieved when average SNR is about 10dB.
引用
收藏
页码:83 / 89
页数:7
相关论文
共 50 条
  • [1] Speech detection in non-stationary noise based on the 1/f process
    Fan Wang
    Fang Zheng
    Wenhu Wu
    Journal of Computer Science and Technology, 2002, 17 : 83 - 89
  • [2] Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise
    Cabanas-Molero, Pablo
    Martinez-Munoz, Damian
    Vera-Candeas, Pedro
    Ruiz-Reyes, Nicolas
    Jose Rodriguez-Serrano, Francisco
    IET SIGNAL PROCESSING, 2014, 8 (02) : 119 - 130
  • [3] Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise
    1600, Institution of Engineering and Technology, United States (08):
  • [4] Speech enhancement for non-stationary noise environments
    Cohen, I
    Berdugo, B
    SIGNAL PROCESSING, 2001, 81 (11) : 2403 - 2418
  • [5] DETECTION OF A NON-STATIONARY SIGNAL IN NOISE
    MCNEIL, DR
    AUSTRALIAN JOURNAL OF PHYSICS, 1967, 20 (03): : 325 - +
  • [6] MODEL-BASED NOISE PSD ESTIMATION FROM SPEECH IN NON-STATIONARY NOISE
    Nielsen, Jesper Kjaer
    Kavalekalam, Mathew Shaji
    Christensen, Mads Graesboll
    Boldt, Jesper
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5424 - 5428
  • [7] Voice activity detection in non-stationary noise
    Li Ye
    Wang Tong
    Cui Huijuan
    Tang Kun
    2006 IMACS: MULTICONFERENCE ON COMPUTATIONAL ENGINEERING IN SYSTEMS APPLICATIONS, VOLS 1 AND 2, 2006, : 1573 - +
  • [8] SPARSE HMM-BASED SPEECH ENHANCEMENT METHOD FOR STATIONARY AND NON-STATIONARY NOISE ENVIRONMENTS
    Deng, Feng
    Bao, Chang-chun
    Kleijn, W. Bastiaan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5073 - 5077
  • [9] Particle filter based non-stationary noise tracking for robust speech recognition
    Fujimoto, M
    Nakamura, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 257 - 260
  • [10] Speech enhancement of non-stationary noise based on Controlled Forward Moving Average
    Farrokhi, Dariush
    Togneri, Roberto
    Zaknich, Anthony
    2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1551 - 1555