Non-Linear Filtering for Feature Enhancement of Reverberant Speech

被引:0
|
作者
Verma, Amit Kumar [1 ]
Tomar, Hemendra [2 ]
Chetupalli, Srikanth Raj [3 ]
Sreenivas, T. V. [3 ]
机构
[1] Elect & Radar Dev Estab LRDE, Bangalore, Karnataka, India
[2] Indian Air Force, New Delhi, India
[3] Indian Inst Sci, Dept Elect Commun Engn, Bangalore, Karnataka, India
关键词
Gaussian Mixture Model(GMM); universal background model (UBM); mel frequency cepstral coefficients (MFCC); non-linear filtering; IDENTIFICATION; MODELS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker identification implemented on a mobile robot is a challenging problem because of varying reverberant environments which the robot encounters while in motion. The performance of a typical speaker identification system degrades significantly in reverberant environments. The degradation in performance is mainly due to the conventional feature being not robust to change in reverberant condition. In this paper, we present a non-linear filter based mel frequency cepstral coefficient (MFCC) feature extraction, which is more robust to changes in reverberant conditions. This feature extraction method is a two stage operation and is applied on the spectrogram of the speech signal. The first stage suppresses the frequency spread due to reverberation within each frame and in the second stage, reverberation effect across the frames is suppressed. The performance is evaluated by the GMM-UBM based identifier built and tested with conventional MFCC feature vectors and with the non-linear filter based MFCC feature vectors. We show that, the identification accuracy of GMM-UBM based identifier with non-linear filter based MFCC feature vectors is better than that of conventional MFCC feature vectors.
引用
收藏
页码:1800 / 1805
页数:6
相关论文
共 50 条
  • [1] A speech enhancement algorithm based on non-linear filtering and noise masking
    Zhang, JJ
    Cao, ZG
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2000, 9 (03) : 296 - 300
  • [2] LINEAR AND NON-LINEAR FILTERING FOR IMAGE-ENHANCEMENT
    KEKRE, HB
    SOLANKI, JK
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 1978, 5 (03) : 283 - 288
  • [3] Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization
    Sami Keronen
    Heikki Kallasjoki
    Kalle J. Palomäki
    Guy J. Brown
    Jort F. Gemmeke
    [J]. EURASIP Journal on Advances in Signal Processing, 2015
  • [4] Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization
    Keronen, Sami
    Kallasjoki, Heikki
    Palomaki, Kalle J.
    Brown, Guy J.
    Gemmeke, Jort F.
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
  • [5] Contrast Enhancement and Detailed Enhancement Method Based on Non-linear Filtering
    Goto, Tomio
    Ikeyama, Miho
    Hirano, Satoshi
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [6] Model-Based Feature Enhancement for Reverberant Speech Recognition
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707
  • [7] MULTICHANNEL SPEECH DEREVERBERATION AND SEPARATION WITH OPTIMIZED COMBINATION OF LINEAR AND NON-LINEAR FILTERING
    Togami, Masahito
    Kawaguchi, Yohei
    Takeda, Ryu
    Obuchi, Yasunari
    Nukaga, Nobuo
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4057 - 4060
  • [8] Non-linear transformations of the feature space for robust speech recognition
    de la Torre, A
    Segura, JC
    Benítez, C
    Peinado, AM
    Rubio, AJ
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 401 - 404
  • [9] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Dong, Huan-Yu
    Lee, Chang-Myung
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [10] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Huan-Yu Dong
    Chang-Myung Lee
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2018