Non-linear feature extraction for robust speech recognition in stationary and non-stationary noise

被引:13
|
作者
Zhu, QF [1 ]
Alwan, A [1 ]
机构
[1] Univ Calif Los Angeles, Henry Samuli Sch Engn & Appl Sci, Dept Elect Engn, Los Angeles, CA 90095 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2003年 / 17卷 / 04期
关键词
D O I
10.1016/S0885-2308(03)00026-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An analysis-based non-linear feature extraction approach is proposed, inspired by a model of how speech amplitude spectra are affected by additive noise. Acoustic features are extracted based on the noise-robust parts of speech spectra without losing discriminative information. Two non-linear processing methods, harmonic demodulation and spectral peak-to-valley ratio locking., are designed to minimize mismatch between clean and noisy speech features. A previously studied method, peak isolation [IEEE Transactions on Speech and Audio Processing 5 (1997) 451]. is also discussed with this model. These methods do not require noise estimation and are effective in dealing with both stationary and non-stationary noise. In the presence of additive noise, ASR experiments show that using these techniques in the computation of MFCCs improves recognition performance greatly. For the T146 isolated digits database. the average recognition rate across several SNRs is improved from 60% (using unmodified MFCCs) to 95% (using the proposed techniques) with additive speech-shaped noise. For the Aurora 2 connected digit-string database, the average recognition rate across different noise types, including non-stationary noise background, and SNRs improves from 58% to 80%. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:381 / 402
页数:22
相关论文
共 50 条
  • [21] A novel denoising method for non-linear and non-stationary signals
    Wu, Honglin
    Wang, Zhongbin
    Si, Lei
    Tan, Chao
    Zou, Xiaoyu
    Liu, Xinhua
    Li, Futao
    [J]. IET SIGNAL PROCESSING, 2023, 17 (01)
  • [22] EEG non-stationary spectrum analysis and feature extraction
    Martins, N
    Rosa, AC
    [J]. INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 881 - 886
  • [23] NON-STATIONARY NOISE ESTIMATION METHOD BASED ON BIAS-RESIDUAL COMPONENT DECOMPOSITION FOR ROBUST SPEECH RECOGNITION
    Fujimoto, Masakiyo
    Watanabe, Shinji
    Nakatani, Tomohiro
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4816 - 4819
  • [24] An Efficient Noise Elimination Method for Non-stationary and Non-linear Signals by Averaging Decomposed Components
    Sun, Zhenzhou
    Lu, Hongchao
    Chen, Jiefeng
    Jiao, Jialong
    [J]. SHOCK AND VIBRATION, 2022, 2022
  • [25] Non-stationary environment compensation using sequential EM algorithm for robust speech recognition
    Shen, HF
    Guo, J
    Liu, G
    Li, QX
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 264 - 273
  • [26] AN ANALYSIS OF VECTOR TAYLOR SERIES MODEL COMPENSATION FOR NON-STATIONARY NOISE IN SPEECH RECOGNITION
    Duc Hoang Ha Nguyen
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 131 - 135
  • [27] Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech
    Norholm, Sidsel Marie
    Jensen, Jesper Rindom
    Christensen, Mads Grsboll
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (04) : 645 - 658
  • [28] Analysis of non-linear and non-stationary seismic recordings of Mexico city
    Garcia, S. R.
    Romo, M. P.
    Alcantara, L.
    [J]. SOIL DYNAMICS AND EARTHQUAKE ENGINEERING, 2019, 127
  • [29] Speech Enhancement in Non-Stationary Noise Using Compressive Sensing
    Sulong, Amart
    Gunawan, Teddy Surya
    Khalifa, Othman O.
    Kartiwi, Mira
    [J]. PROCEEDINGS OF 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE 2016), 2016, : 489 - 493
  • [30] A Computational Intelligence Technique for the Identification of Non-Linear Non-Stationary Systems
    Turchetti, Claudio
    Gianfelici, Francesco
    Biagetti, Giorgio
    Crippa, Paolo
    [J]. 2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3034 - 3038