Robust Feature Extraction Using Modulation Filtering of Autoregressive Models

被引:39
|
作者
Ganapathy, Sriram [1 ]
Mallidi, Sri Harish [2 ]
Hermansky, Hynek [2 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
Autoregressive modeling; feature extraction; modulation filtering; speaker and language recognition; FRONT-END; SPEECH; RECOGNITION;
D O I
10.1109/TASLP.2014.2329190
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker and language recognition in noisy and degraded channel conditions continue to be a challenging problem mainly due to the mismatch between clean training and noisy test conditions. In the presence of noise, the most reliable portions of the signal are the high energy regions which can be used for robust feature extraction. In this paper, we propose a front end processing scheme based on autoregressive (AR) models that represent the high energy regions with good accuracy followed by a modulation filtering process. The AR model of the spectrogram is derived using two separable time and frequency AR transforms. The first AR model (temporal AR model) of the sub-band Hilbert envelopes is derived using frequency domain linear prediction (FDLP). This is followed by a spectral AR model applied on the FDLP envelopes. The output 2-D AR model represents a low-pass modulation filtered spectrogram of the speech signal. The band-pass modulation filtered spectrograms can further be derived by dividing two AR models with different model orders (cut-off frequencies). The modulation filtered spectrograms are converted to cepstral coefficients and are used for a speaker recognition task in noisy and reverberant conditions. Various speaker recognition experiments are performed with clean and noisy versions of the NIST-2010 speaker recognition evaluation (SRE) database using the state-of-the-art speaker recognition system. In these experiments, the proposed front-end analysis provides substantial improvements (relative improvements of up to 25%) compared to baseline techniques. Furthermore, we also illustrate the generalizability of the proposed methods using language identification (LID) experiments on highly degraded high-frequency (HF) radio channels and speech recognition experiments on noisy data.
引用
收藏
页码:1285 / 1295
页数:11
相关论文
共 50 条
  • [41] Speech analysis and feature extraction using chaotic models
    Pitsikalis, V
    Maragos, P
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 533 - 536
  • [42] Sequential robust estimation for nonparametric autoregressive models
    Arkoun, Ouerdia
    Pergamenchtchikov, Serguei
    SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS, 2016, 35 (04): : 489 - 515
  • [43] Alternative Robust Estimators for Autoregressive Models with Outliers Using Differential Evolution Algorithm
    Addawe, Rizavel C.
    Addawe, Joel M.
    Magadia, Joselito C.
    PROCEEDING OF THE 4TH INTERNATIONAL CONFERENCE OF FUNDAMENTAL AND APPLIED SCIENCES 2016 (ICFAS2016), 2016, 1787
  • [44] Local feature extraction in fingerprints by complex filtering
    Ronthaler, H
    Kollreider, K
    Bigun, J
    ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3781 : 77 - 84
  • [45] Robust Two-stage Kalman Filtering in Presence of Autoregressive Input
    Zhuang, Huiping
    Li, Junhui
    2016 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2016,
  • [46] Optimal filtering for unsupervised texture feature extraction
    Randen, T
    Alvestad, V
    Husoy, JH
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '96, 1996, 2727 : 441 - 452
  • [47] Robust filtering of SAR image based on multiscale autoregressive graphical model
    School of Computer Science and Technology, Tianjin University of Technology, Tianjin 300191, China
    Guangdianzi Jiguang, 2007, 4 (471-474):
  • [48] Feature characterization in iris recognition with stochastic autoregressive models
    Castanon, Luis E. Garza
    de Oca, Saul Montes
    Morales-Menendez, Ruben
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 168 - 177
  • [49] Feature Extraction using Symbolic Dynamic Filtering for Fault Analysis in Distribution Systems
    Saxena, Kritika
    Gurrala, Gurunath
    2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [50] A robust facial feature point tracker using graphical models
    Cosar, Serhan
    Cetin, Muejdat
    Ercil, Aytuel
    PROCEEDINGS OF THE 5TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2007, : 550 - 555