Robust Feature Extraction Using Modulation Filtering of Autoregressive Models

被引:39
|
作者
Ganapathy, Sriram [1 ]
Mallidi, Sri Harish [2 ]
Hermansky, Hynek [2 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
Autoregressive modeling; feature extraction; modulation filtering; speaker and language recognition; FRONT-END; SPEECH; RECOGNITION;
D O I
10.1109/TASLP.2014.2329190
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker and language recognition in noisy and degraded channel conditions continue to be a challenging problem mainly due to the mismatch between clean training and noisy test conditions. In the presence of noise, the most reliable portions of the signal are the high energy regions which can be used for robust feature extraction. In this paper, we propose a front end processing scheme based on autoregressive (AR) models that represent the high energy regions with good accuracy followed by a modulation filtering process. The AR model of the spectrogram is derived using two separable time and frequency AR transforms. The first AR model (temporal AR model) of the sub-band Hilbert envelopes is derived using frequency domain linear prediction (FDLP). This is followed by a spectral AR model applied on the FDLP envelopes. The output 2-D AR model represents a low-pass modulation filtered spectrogram of the speech signal. The band-pass modulation filtered spectrograms can further be derived by dividing two AR models with different model orders (cut-off frequencies). The modulation filtered spectrograms are converted to cepstral coefficients and are used for a speaker recognition task in noisy and reverberant conditions. Various speaker recognition experiments are performed with clean and noisy versions of the NIST-2010 speaker recognition evaluation (SRE) database using the state-of-the-art speaker recognition system. In these experiments, the proposed front-end analysis provides substantial improvements (relative improvements of up to 25%) compared to baseline techniques. Furthermore, we also illustrate the generalizability of the proposed methods using language identification (LID) experiments on highly degraded high-frequency (HF) radio channels and speech recognition experiments on noisy data.
引用
收藏
页码:1285 / 1295
页数:11
相关论文
共 50 条
  • [31] Part identification using robust feature extraction and pattern classification
    Hunt, MA
    Hicks, JS
    Gleason, SS
    MACHINE VISION APPLICATIONS IN INDUSTRIAL INSPECTION IV, 1996, 2665 : 219 - 230
  • [32] Sparse Robust Dynamic Feature Extraction using Bayesian Inference
    Puli, Vamsi Krishna
    Chiplunkar, Ranjith
    Huang, Biao
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (06) : 6201 - 6209
  • [33] A robust texture feature extraction using the localized angular phase
    Saipullah, Khairul Muzzammil
    Kim, Deok-Hwan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 59 (03) : 717 - 747
  • [34] Robust estimation of sparse vector autoregressive models
    Kim, Dongyeong
    Baek, Changryong
    KOREAN JOURNAL OF APPLIED STATISTICS, 2022, 35 (05) : 631 - 644
  • [35] Robust Estimation Procedure for Autoregressive Models with Heterogeneity
    A. Callens
    Y.-G. Wang
    L. Fu
    B. Liquet
    Environmental Modeling & Assessment, 2021, 26 : 313 - 323
  • [36] Automatic Digital Modulation Recognition Using Minimum Feature Extraction
    Kumar, Punith H. L.
    Shrinivasan, Lakshmi
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 772 - 775
  • [37] Automatic Digital Modulation Recognition System Using Feature Extraction
    Kumar, H. L. Punith
    Shrinivasan, Lakshmi
    EMERGING TRENDS IN ELECTRICAL, COMMUNICATIONS AND INFORMATION TECHNOLOGIES, 2017, 394 : 201 - 208
  • [38] Robust Estimation Procedure for Autoregressive Models with Heterogeneity
    Callens, A.
    Wang, Y. -G.
    Fu, L.
    Liquet, B.
    ENVIRONMENTAL MODELING & ASSESSMENT, 2021, 26 (03) : 313 - 323
  • [39] ROBUST IDENTIFICATION OF AUTOREGRESSIVE MOVING AVERAGE MODELS
    MASAROTTO, G
    APPLIED STATISTICS-JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C, 1987, 36 (02): : 214 - 220
  • [40] Novel filtering and regeneration technique with statistical feature extraction and machine learning for automatic modulation classification
    Sarmanbetov, Sanzhar
    Nurgaliyev, Madiyar
    Zholamanov, Batyrbek
    Kopbay, Kymbat
    Saymbetov, Ahmet
    Bolatbek, Askhat
    Kuttybay, Nurzhigit
    Orynbassar, Sayat
    Yershov, Evan
    DIGITAL SIGNAL PROCESSING, 2024, 155