Speech detection in non-stationary noise based on the 1/f process

被引：0

作者：

Fan Wang

Fang Zheng

Wenhu Wu

机构：

[1] Tsinghua University,Center of Speech Technology, State Key Laboratory of Intelligent, Technology and Systems Department of Computer Science and Technology

来源：

Journal of Computer Science and Technology | 2002年 / 17卷

关键词：

speech detection; 1/; process; wavelet; robust speech recognition;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this paper, an effective and robust active speech detection method is proposed based on the 1/f process technique for signals under non-stationary noisy environments. The Gaussian 1/f process, a mathematical model for statistically self-similar random processes based on fractals, is selected to model both the speech and the background noise. An optimal Bayesian two-class classifier is developed to discriminate them by their 1/f wavelet coefficients with Karhunen-Loeve-type properties. Multiple templates are trained for the speech signal, and the parameters of the background noise can be dynamically adapted in runtime to model the variation of both the speech and the noise. In our experiments, a 10-minute long speech with different types of noises ranging from 20dB to 5dB is tested using this new detection method. A high performance with over 90% detection accuracy is achieved when average SNR is about 10dB.

引用

页码：83 / 89

页数：6

共 50 条

[1] Speech detection in non-stationary noise based on the 1/f process
Wang, F
Zheng, F
Wu, WH
[J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (01): : 83 - 89
[2] Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise
Cabanas-Molero, Pablo
Martinez-Munoz, Damian
Vera-Candeas, Pedro
Ruiz-Reyes, Nicolas
Jose Rodriguez-Serrano, Francisco
[J]. IET SIGNAL PROCESSING, 2014, 8 (02) : 119 - 130
[3] Speech enhancement for non-stationary noise environments
Cohen, I
Berdugo, B
[J]. SIGNAL PROCESSING, 2001, 81 (11) : 2403 - 2418
[4] DETECTION OF A NON-STATIONARY SIGNAL IN NOISE
MCNEIL, DR
[J]. AUSTRALIAN JOURNAL OF PHYSICS, 1967, 20 (03): : 325 - +
[5] MODEL-BASED NOISE PSD ESTIMATION FROM SPEECH IN NON-STATIONARY NOISE
Nielsen, Jesper Kjaer
Kavalekalam, Mathew Shaji
Christensen, Mads Graesboll
Boldt, Jesper
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5424 - 5428
[6] Voice activity detection in non-stationary noise
Li Ye
Wang Tong
Cui Huijuan
Tang Kun
[J]. 2006 IMACS: MULTICONFERENCE ON COMPUTATIONAL ENGINEERING IN SYSTEMS APPLICATIONS, VOLS 1 AND 2, 2006, : 1573 - +
[7] SPARSE HMM-BASED SPEECH ENHANCEMENT METHOD FOR STATIONARY AND NON-STATIONARY NOISE ENVIRONMENTS
Deng, Feng
Bao, Chang-chun
Kleijn, W. Bastiaan
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5073 - 5077
[8] Particle filter based non-stationary noise tracking for robust speech recognition
Fujimoto, M
Nakamura, S
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 257 - 260
[9] Speech enhancement of non-stationary noise based on Controlled Forward Moving Average
Farrokhi, Dariush
Togneri, Roberto
Zaknich, Anthony
[J]. 2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1551 - 1555
[10] Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech
Norholm, Sidsel Marie
Jensen, Jesper Rindom
Christensen, Mads Grsboll
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (04) : 645 - 658

← 1 2 3 4 5 →