Noise-robust speech feature processing with empirical mode decomposition

被引:3
|
作者
Wu, Kuo-Hau [1 ]
Chen, Chia-Ping [1 ]
Yeh, Bing-Feng [1 ]
机构
[1] Natl Sun Yat Sen Univ, Dept Comp Sci & Engn, Kaohsiung 800, Taiwan
关键词
Speech Signal; Empirical Mode Decomposition; Automatic Speech Recognition; Intrinsic Mode Function; Lower Envelope;
D O I
10.1186/1687-4722-2011-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this article, a novel technique based on the empirical mode decomposition methodology for processing speech features is proposed and investigated. The empirical mode decomposition generalizes the Fourier analysis. It decomposes a signal as the sum of intrinsic mode functions. In this study, we implement an iterative algorithm to find the intrinsic mode functions for any given signal. We design a novel speech feature post-processing method based on the extracted intrinsic mode functions to achieve noise-robustness for automatic speech recognition. Evaluation results on the noisy-digit Aurora 2.0 database show that our method leads to significant performance improvement. The relative improvement over the baseline features increases from 24.0 to 41.1% when the proposed post-processing method is applied on mean-variance normalized speech features. The proposed method also improves over the performance achieved by a very noise-robust frontend when the test speech data are highly mismatched.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 50 条
  • [1] Noise-robust speech feature processing with empirical mode decomposition
    Kuo-Hau Wu
    Chia-Ping Chen
    Bing-Feng Yeh
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [2] Empirical Mode Decomposition For Noise-Robust Automatic Speech Recognition
    Wu, Kuo-Hao
    Chen, Chia-Ping
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2074 - 2077
  • [3] Noise-Robust Speech Signals Processing for the Voice Control System Based on the Complementary Ensemble Empirical Mode Decomposition
    Kazanferovich, Alimuradov Alan
    Pavlovich, Churakov Pyotr
    [J]. 2015 INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATIONS (SIBCON), 2015,
  • [4] Nonlinear mode decomposition: A noise-robust, adaptive decomposition method
    Iatsenko, Dmytro
    McClintock, Peter V. E.
    Stefanovska, Aneta
    [J]. PHYSICAL REVIEW E, 2015, 92 (03):
  • [5] Factorial Speech Processing Models for Noise-Robust Automatic Speech Recognition
    Khademian, Mahdi
    Homayounpour, Mohammad Mehdi
    [J]. 2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 637 - 642
  • [6] On the temporal decorrelation of feature parameters for noise-robust speech recognition
    Jung, HY
    Lee, SY
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 407 - 416
  • [7] Noise-robust adaptive feature mode decomposition method for accurate feature extraction in rotating machinery fault diagnosis
    Chen, Yuyang
    Mao, Zhiwei
    Hou, Xiuqun
    Zhang, Zhaoguang
    Zhang, Jinjie
    Jiang, Zhinong
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2024, 211
  • [8] Noise-robust speech triage
    Bartos, Anthony L.
    Cipr, Tomas
    Nelson, Douglas J.
    Schwarz, Petr
    Banowetz, John
    Jerabek, Ladislav
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (04): : 2313 - 2320
  • [9] INTERACTIVE FEATURE FUSION FOR END-TO-END NOISE-ROBUST SPEECH RECOGNITION
    Hu, Yuchen
    Hou, Nana
    Chen, Chen
    Chng, Eng Siong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6292 - 6296
  • [10] Fusion Feature Extraction Based on Auditory and Energy for Noise-Robust Speech Recognition
    Shi, Yanyan
    Bai, Jing
    Xue, Peiyun
    Shi, Dianxi
    [J]. IEEE ACCESS, 2019, 7 : 81911 - 81922