Speech detection in non-stationary noise based on the 1/f process

被引：2

作者：

Wang, F ^{[1
]}

Zheng, F ^{[1
]}

Wu, WH ^{[1
]}

机构：

[1] Tsing Hua Univ, Dept Comp Sci & Technol, State Key Lab Intelligent Technol & Syst, Ctr Speech Technol, Beijing 100084, Peoples R China

来源：

JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY | 2002年 / 17卷 / 01期

关键词：

speech detection; 1/f process; wavelet; robust speech recognition;

D O I：

10.1007/BF02949828

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an effective and robust active speech detection method is proposed based on the 1/f process technique for signals under non-stationary noisy environments. The Gaussian 1/f process, a mathematical model for statistically self-similar random processes based on fractals, is selected to model both the speech and the background noise. An optimal Bayesian two-class classifier is developed to discriminate them by their 1/f wavelet coefficients with Karhunen-Loeve-type properties. Multiple templates are trained for the speech signal, and the parameters of the background noise can be dynamically adapted in runtime to model the variation of both the speech and the noise. In our experiments, a 10-minute long speech with different types of noises ranging from 20dB to 5dB is tested using this new detection method. A high performance with over 90% detection accuracy is achieved when average SNR is about 10dB.

引用

页码：83 / 89

页数：7

共 50 条

[21] Markovian Segmentation of Non-stationary Data Corrupted by Non-stationary Noise
Habbouchi, Ahmed
Boudaren, Mohamed El Yazid
Senouci, Mustapha Reda
Aissani, Amar
ADVANCES IN COMPUTING SYSTEMS AND APPLICATIONS, 2022, 513 : 27 - 37
[22] ESTIMATION OF THE MATRIX PARAMETER OF THE AUTOREGRESSIVE PROCESS WITH NON-STATIONARY NOISE
Yurachkivskii, A. P.
Ivanenko, D. O.
THEORY OF PROBABILITY AND MATHEMATICAL STATISTICS, 2005, 72 : 158 - 171
[23] Towards non-stationary model-based noise adaptation for large vocabulary speech recognition
Kristjansson, T
Frey, B
Deng, L
Acero, A
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 337 - 340
[24] An Adaptive Wavelet-Based Denoising Algorithm for Enhancing Speech in Non-stationary Noise Environment
Wang, Kun-Ching
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (02): : 341 - 349
[25] Mask Estimation in Non-stationary Noise Environments for Missing Feature Based Robust Speech Recognition
Badiezadegan, Shirin
Rose, Richard C.
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2062 - 2065
[26] Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments
Heitkaemper, Jens
Schmalenstroeer, Joerg
Haeb-Umbach, Reinhold
INTERSPEECH 2020, 2020, : 2597 - 2601
[27] Noise/spike detection in phonocardiogram signal as a cyclic random process with non-stationary period interval
Naseri, H.
Homaeinezhad, M. R.
Pourkhajeh, H.
COMPUTERS IN BIOLOGY AND MEDICINE, 2013, 43 (09) : 1205 - 1213
[28] Correntropy based IPKF filter for parameter estimation in presence of non-stationary noise process
Sen, Subhamoy
Criniere, Antoine
Mevel, Laurent
Cerou, Frederic
Dumoulin, Jean
IFAC PAPERSONLINE, 2018, 51 (24): : 420 - 427
[29] PITCH ESTIMATION FOR NON-STATIONARY SPEECH
Christensen, Mads Graesboll
Jensen, Jesper Rindom
CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 1400 - 1404
[30] Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments
Malah, D
Cox, RV
Accardi, AJ
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 789 - 792

← 1 2 3 4 5 →