Robust Speech Analysis Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition in Noisy Environments

被引:0
|
作者
Boonkla, Surasak [1 ,2 ]
Unoki, Masashi [1 ]
Makhanov, Stanislav S. [2 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Japan
[2] Thammasat Univ, Sirindhorn Int Inst Technol, Pathum Thani, Thailand
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Multivariate empirical mode decomposition; Speech analysis; Fundamental frequency; Formant frequency; Source-filter model;
D O I
10.1007/978-3-319-43958-7_70
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a robust speech analysis method based on source-filter model using multivariate empirical mode decomposition (MEMD) under noisy conditions. The proposed method has two stages. At the first stage, magnitude spectrum of noisy speech signal is decomposed by MEMD into intrinsic mode functions (IMFs), and then IMFs corresponded to noise part are removed from them. At the second stage, log-magnitude spectrum of noise-reduced signals are decomposed into IMFs. Then, these are divided into two groups: the first group characterized by spectral fine structure for fundamental frequency estimation and the second group characterized by frequency response of vocal-tract filter for formant frequencies estimation. As opposed to the conventional linear prediction (LP) and cepstrum methods, the proposed method decomposes noise automatically in magnitude spectral domain and makes noise mixture become sparse in log-magnitude spectral domain. The results show that the proposed method outperforms LP and cepstrum methods under noisy conditions.
引用
收藏
页码:580 / 587
页数:8
相关论文
共 50 条
  • [1] Speech Analysis Method Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition
    Boonkla, Surasak
    Unoki, Masashi
    Makhanov, Stanislav S.
    Wutiwiwatchai, Chai
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (10) : 1762 - 1773
  • [2] Speech Analysis Method Based on Source-Filter Model Using Multivariate Empirical Mode Decomposition in Log-Spectrum Domain
    Boonkla, Surasak
    Unoki, Masashi
    Makhanov, Stanislav S.
    Wutiwiwatchai, Chai
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 555 - +
  • [3] Speech Stream Detection for Noisy Environments Based on Empirical Mode Decomposition
    Tang Qiang
    Zhang Dexiang
    Yan Qing
    ADVANCED DESIGN AND MANUFACTURING TECHNOLOGY III, PTS 1-4, 2013, 397-400 : 2239 - +
  • [4] Robust glottal source estimation based on joint source-filter model optimization
    Fu, Qiang
    Murphy, Peter
    Yan, Yong-Hong
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (05): : 982 - 986
  • [5] Robust glottal source estimation based on joint source-filter model optimization
    Fu, Q
    Murphy, P
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 492 - 501
  • [6] Pitch Estimation of Noisy Speech Signals using Empirical Mode Decomposition
    Molla, Md. Khademul Islam
    Hirose, Keikichi
    Minematsu, Nobuaki
    Hasan, Md. Kamrul
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2177 - +
  • [7] Empirical Mode Decomposition Based Reconstruction of Speech Signal in Noisy Environment
    Goswami, Nisha
    Sarma, Mousmita
    Sarma, Kandarpa Kumar
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2014, : 760 - 765
  • [8] Speech Endpoint Detection in Noisy Environment Based on the Ensemble Empirical Mode Decomposition
    Li, Jingjiao
    An, Dong
    Wang, Jiao
    Rong, Chaoqun
    MECHATRONICS AND INFORMATION TECHNOLOGY, PTS 1 AND 2, 2012, 2-3 : 135 - 139
  • [9] A Source-Filter based Adaptive Harmonic Model and Its Application to Speech Prosody Modification
    Lee, JeeSok
    Soong, Frank K.
    Kang, Hong-Goo
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 39 - 43
  • [10] Reconstruction Of Speech Signal Using Empirical Mode Decomposition Based Glottal Source Extraction
    Goswami, Nisha
    Sarma, Mousmita
    Sarma, Kandarpa Kumar
    2013 1ST INTERNATIONAL CONFERENCE ON EMERGING TRENDS AND APPLICATIONS IN COMPUTER SCIENCE (ICETACS), 2013, : 27 - 32