Speech detection in non-stationary noise based on the 1/f process

被引:2
|
作者
Wang, F [1 ]
Zheng, F [1 ]
Wu, WH [1 ]
机构
[1] Tsing Hua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Ctr Speech Technol, Beijing 100084, Peoples R China
关键词
speech detection; 1/f process; wavelet; robust speech recognition;
D O I
10.1007/BF02949828
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an effective and robust active speech detection method is proposed based on the 1/f process technique for signals under non-stationary noisy environments. The Gaussian 1/f process, a mathematical model for statistically self-similar random processes based on fractals, is selected to model both the speech and the background noise. An optimal Bayesian two-class classifier is developed to discriminate them by their 1/f wavelet coefficients with Karhunen-Loeve-type properties. Multiple templates are trained for the speech signal, and the parameters of the background noise can be dynamically adapted in runtime to model the variation of both the speech and the noise. In our experiments, a 10-minute long speech with different types of noises ranging from 20dB to 5dB is tested using this new detection method. A high performance with over 90% detection accuracy is achieved when average SNR is about 10dB.
引用
收藏
页码:83 / 89
页数:7
相关论文
共 50 条
  • [41] Speech Enhancement by Online Non-negative Spectrogram Decomposition in Non-stationary Noise Environments
    Duan, Zhiyao
    Mysore, Gautham J.
    Smaragdis, Paris
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 594 - 597
  • [42] Non-stationary signal noise suppression based on wavelet analysis
    Qu Wei
    Jia Xin
    Pei Shibing
    Wu Jie
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 303 - 306
  • [43] Detection of T wave alternans in non-stationary noise:: a GLRT approach
    Martínez, JP
    Olmos, S
    COMPUTERS IN CARDIOLOGY 2003, VOL 30, 2003, 30 : 161 - 164
  • [44] SEQUENTIAL DETECTION OF SIGNALS ON A BACKGROUND OF NON-STATIONARY NORMAL NOISE.
    Shloma, A.M.
    Gol'feld, G.B.
    Telecommunications and Radio Engineering (English translation of Elektrosvyaz and Radiotekhnika), 1979, 33-34 (05): : 114 - 116
  • [45] A noise reduction method for non-stationary noise based on noise reconstruction system with ALE
    Sasaoka, N
    Itoh, Y
    Fujii, K
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (02) : 593 - 596
  • [46] Dynamic adjustment of the forgetting factor in adaptive filters for non-stationary noise cancellation in speech
    Martinez, R
    Gomez, P
    Alvarez, A
    Nieto, V
    Rodellar, V
    Rubio, M
    Perez, M
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1009 - 1012
  • [47] An Algorithm of Single-Microphone Telephone Speech Enhancement in Non-Stationary Noise Environment
    Yao, Yuan
    Wang, Xia
    Xue, Tao
    2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING (WICOM), 2012,
  • [48] A Novel Expectation-Maximization Framework for Speech Enhancement in Non-Stationary Noise Environments
    Lun, Daniel P. K.
    Shen, Tak-Wai
    Ho, K. C.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 335 - 346
  • [49] A novel expectation-maximization framework for speech enhancement in non-stationary noise environments
    Lun, Daniel P. K.
    Shen, Tak-Wai
    Ho, K.C.
    IEEE Transactions on Audio, Speech and Language Processing, 2014, 22 (02): : 335 - 346
  • [50] AN ANALYSIS OF VECTOR TAYLOR SERIES MODEL COMPENSATION FOR NON-STATIONARY NOISE IN SPEECH RECOGNITION
    Duc Hoang Ha Nguyen
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 131 - 135