Noise-robust speech feature processing with empirical mode decomposition

被引:0
|
作者
Kuo-Hau Wu
Chia-Ping Chen
Bing-Feng Yeh
机构
[1] National Sun Yat-Sen University,Department of Computer Science and Engineering
关键词
Speech Signal; Empirical Mode Decomposition; Automatic Speech Recognition; Intrinsic Mode Function; Lower Envelope;
D O I
暂无
中图分类号
学科分类号
摘要
In this article, a novel technique based on the empirical mode decomposition methodology for processing speech features is proposed and investigated. The empirical mode decomposition generalizes the Fourier analysis. It decomposes a signal as the sum of intrinsic mode functions. In this study, we implement an iterative algorithm to find the intrinsic mode functions for any given signal. We design a novel speech feature post-processing method based on the extracted intrinsic mode functions to achieve noise-robustness for automatic speech recognition. Evaluation results on the noisy-digit Aurora 2.0 database show that our method leads to significant performance improvement. The relative improvement over the baseline features increases from 24.0 to 41.1% when the proposed post-processing method is applied on mean-variance normalized speech features. The proposed method also improves over the performance achieved by a very noise-robust frontend when the test speech data are highly mismatched.
引用
收藏
相关论文
共 50 条
  • [1] Noise-robust speech feature processing with empirical mode decomposition
    Wu, Kuo-Hau
    Chen, Chia-Ping
    Yeh, Bing-Feng
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
  • [2] Empirical Mode Decomposition For Noise-Robust Automatic Speech Recognition
    Wu, Kuo-Hao
    Chen, Chia-Ping
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2074 - 2077
  • [3] Noise-Robust Speech Signals Processing for the Voice Control System Based on the Complementary Ensemble Empirical Mode Decomposition
    Kazanferovich, Alimuradov Alan
    Pavlovich, Churakov Pyotr
    2015 INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATIONS (SIBCON), 2015,
  • [4] Nonlinear mode decomposition: A noise-robust, adaptive decomposition method
    Iatsenko, Dmytro
    McClintock, Peter V. E.
    Stefanovska, Aneta
    PHYSICAL REVIEW E, 2015, 92 (03):
  • [5] Factorial Speech Processing Models for Noise-Robust Automatic Speech Recognition
    Khademian, Mahdi
    Homayounpour, Mohammad Mehdi
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 637 - 642
  • [6] On the temporal decorrelation of feature parameters for noise-robust speech recognition
    Jung, HY
    Lee, SY
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 407 - 416
  • [7] Noise-robust adaptive feature mode decomposition method for accurate feature extraction in rotating machinery fault diagnosis
    Chen, Yuyang
    Mao, Zhiwei
    Hou, Xiuqun
    Zhang, Zhaoguang
    Zhang, Jinjie
    Jiang, Zhinong
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2024, 211
  • [8] Noise-robust speech triage
    Bartos, Anthony L.
    Cipr, Tomas
    Nelson, Douglas J.
    Schwarz, Petr
    Banowetz, John
    Jerabek, Ladislav
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (04): : 2313 - 2320
  • [9] New continuous speech feature adjustment for a noise-robust CSR system
    Sun, Yiming
    Miyanaga, Yoshikazu
    11th International Symposium on Communications and Information Technologies, ISCIT 2011, 2011, : 309 - 313
  • [10] INTERACTIVE FEATURE FUSION FOR END-TO-END NOISE-ROBUST SPEECH RECOGNITION
    Hu, Yuchen
    Hou, Nana
    Chen, Chen
    Chng, Eng Siong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6292 - 6296