Improved MFCC-Based Feature for Robust Speaker Identification

被引:11
|
作者
吴尊敬
曹志刚
机构
[1] China
[2] Department of Electronic Engineering
[3] Tsinghua University
[4] State Key Laboratory on Microwave and Digital Communications
[5] Beijing 100084
基金
中国国家自然科学基金;
关键词
Mel-frequency cepstral coefficient (MFCC); robust speaker identification; feature extraction;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the performance of recognition systems because of the mismatches between training and testing. In this paper, the logarithmic transformation in the standard MFCC analysis is replaced by a combined function to improve the noisy sensitivity. The proposed feature extraction process is also combined with speech en- hancement methods, such as spectral subtraction and median-filter to further suppress the noise. Experi- ments show that the proposed robust MFCC-based feature significantly reduces the recognition error rate over a wide signal-to-noise ratio range.
引用
收藏
页码:158 / 161
页数:4
相关论文
共 50 条
  • [31] An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition
    Yan, Furong
    Men, Aidong
    Yang, Bo
    Jiang, Zhuqing
    IEEE ACCESS, 2016, 4 : 5258 - 5267
  • [32] Cardiac sound classification using a hybrid approach: MFCC-based feature fusion and CNN deep features
    Mahbubeh Bahreini
    Ramin Barati
    Abbas Kamali
    EURASIP Journal on Advances in Signal Processing, 2025 (1)
  • [33] MISSING FEATURE RECONSTRUCTION METHODS FOR ROBUST SPEAKER IDENTIFICATION
    Zhang, Xueliang
    Zhang, Hui
    Gao, Guanglai
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1482 - 1486
  • [34] Incorporating auditory feature uncertainties in robust speaker identification
    Shao, Yang
    Srinivasan, Soundararajan
    Wang, DeLiang
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 277 - +
  • [35] Acoustic feature extraction method for robust speaker identification
    Zuoqiang Li
    Yong Gao
    Multimedia Tools and Applications, 2016, 75 : 7391 - 7406
  • [36] A Feature Level Fusion Scheme for Robust Speaker Identification
    Sekkate, Sara
    Khalil, Mohammed
    Adib, Abdellah
    BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 289 - 300
  • [37] Spectral-temporal receptive fields and MFCC balanced feature extraction for robust speaker recognition
    Wang, Jia-Ching
    Wang, Chien-Yao
    Chin, Yu-Hao
    Liu, Yu-Ting
    Chen, En-Ting
    Chang, Pao-Chi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) : 4055 - 4068
  • [38] Spectral-temporal receptive fields and MFCC balanced feature extraction for robust speaker recognition
    Jia-Ching Wang
    Chien-Yao Wang
    Yu-Hao Chin
    Yu-Ting Liu
    En-Ting Chen
    Pao-Chi Chang
    Multimedia Tools and Applications, 2017, 76 : 4055 - 4068
  • [39] ROBUST FEATURE FRONT-END FOR SPEAKER IDENTIFICATION
    Liu, Gang
    Lei, Yun
    Hansen, John H. L.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4233 - 4236
  • [40] Acoustic feature extraction method for robust speaker identification
    Li, Zuoqiang
    Gao, Yong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (12) : 7391 - 7406