Speech detection in non-stationary noise based on the 1/f process

被引:2
|
作者
Wang, F [1 ]
Zheng, F [1 ]
Wu, WH [1 ]
机构
[1] Tsing Hua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Ctr Speech Technol, Beijing 100084, Peoples R China
关键词
speech detection; 1/f process; wavelet; robust speech recognition;
D O I
10.1007/BF02949828
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an effective and robust active speech detection method is proposed based on the 1/f process technique for signals under non-stationary noisy environments. The Gaussian 1/f process, a mathematical model for statistically self-similar random processes based on fractals, is selected to model both the speech and the background noise. An optimal Bayesian two-class classifier is developed to discriminate them by their 1/f wavelet coefficients with Karhunen-Loeve-type properties. Multiple templates are trained for the speech signal, and the parameters of the background noise can be dynamically adapted in runtime to model the variation of both the speech and the noise. In our experiments, a 10-minute long speech with different types of noises ranging from 20dB to 5dB is tested using this new detection method. A high performance with over 90% detection accuracy is achieved when average SNR is about 10dB.
引用
收藏
页码:83 / 89
页数:7
相关论文
共 50 条
  • [31] Sparse Hidden Markov Models for Speech Enhancement in Non-Stationary Noise Environments
    Deng, Feng
    Bao, Changchun
    Kleijn, W. Bastiaan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1973 - 1987
  • [32] A more effective speech enhancement algorithm under non-stationary noise environment
    Cheng, Gong
    Guo, Lei
    Zhao, Tianyun
    He, Sheng
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2010, 28 (05): : 664 - 668
  • [33] Robust Speech Enhancement Techniques for ASR in Non-stationary Noise and Dynamic Environments
    Liu, Gang
    Dimitriadis, Dimitrios
    Bocchieri, Enrico
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3016 - 3020
  • [34] Noise estimation for speech enhancement in non-stationary environments-A new method
    Rama Rao, Ch. V.
    Gowthami
    Harsha
    Rajkumar
    Rama Murthy, M.B.
    Srinivasa Rao, K.
    Anitha Sheela, K.
    World Academy of Science, Engineering and Technology, 2010, 46 : 738 - 741
  • [35] Simplified KLT and wavelet transform based approach for enhancing speech degraded by non-stationary wideband noise
    Lou, Hong-Wei
    Hu, Guang-Rui
    Kongzhi yu Juece/Control and Decision, 2003, 18 (05): : 577 - 580
  • [36] An approach based on simplified KLT and wavelet transform for enhancing speech degraded by non-stationary wideband noise
    Lou, HW
    Hu, GR
    JOURNAL OF SOUND AND VIBRATION, 2003, 268 (04) : 717 - 729
  • [37] Non-stationary correlation matrices and noise
    Martins, Andre C. R.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2007, 379 (02) : 552 - 558
  • [38] The phase space of non-stationary noise
    Galleani, L
    Cohen, L
    JOURNAL OF MODERN OPTICS, 2004, 51 (16-18) : 2731 - 2740
  • [39] Stationary and non-stationary noise in superconducting quantum devices
    Martin, I.
    Bulaevskii, L.
    Shnirman, A.
    Galperin, Y. M.
    NOISE AND FLUCTUATIONS IN CIRCUITS, DEVICES, AND MATERIALS, 2007, 6600
  • [40] Modelling a non-stationary BINAR(1) Poisson process
    Khan, Naushad Mamode
    Sunecher, Yuvraj
    Jowaheer, Vandna
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (15) : 3106 - 3126