Improved MFCC-Based Feature for Robust Speaker Identification

被引:11
|
作者
吴尊敬
曹志刚
机构
[1] China
[2] Department of Electronic Engineering
[3] Tsinghua University
[4] State Key Laboratory on Microwave and Digital Communications
[5] Beijing 100084
基金
中国国家自然科学基金;
关键词
Mel-frequency cepstral coefficient (MFCC); robust speaker identification; feature extraction;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the performance of recognition systems because of the mismatches between training and testing. In this paper, the logarithmic transformation in the standard MFCC analysis is replaced by a combined function to improve the noisy sensitivity. The proposed feature extraction process is also combined with speech en- hancement methods, such as spectral subtraction and median-filter to further suppress the noise. Experi- ments show that the proposed robust MFCC-based feature significantly reduces the recognition error rate over a wide signal-to-noise ratio range.
引用
收藏
页码:158 / 161
页数:4
相关论文
共 50 条
  • [21] ROBUST SPEAKER IDENTIFICATION USING AN AUDITORY-BASED FEATURE
    Li, Qi
    Huang, Yan
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4514 - 4517
  • [22] Robust speaker identification based on selective use of feature vectors
    Kwon, Soonil
    Narayanan, Shrikanth
    PATTERN RECOGNITION LETTERS, 2007, 28 (01) : 85 - 89
  • [23] Speaker Identification Using MFCC Feature Extraction ANN Classification Technique
    Singh, Mahesh K.
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (01) : 453 - 467
  • [24] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
    Zhou, Xi
    Fu, Yun
    Liu, Ming
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
  • [25] Robust Automatic Speaker Identification System Using Shuffled MFCC Features
    Barhoush, Mahdi
    Hallawa, Ahmed
    Schmeink, Anke
    2021 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES (ICMLANT II), 2021, : 28 - 33
  • [26] Speaker identification based on combination of MFCC and UMRT based features
    Antony, Anett
    Gopikakumari, R.
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 250 - 257
  • [27] MFCc-based feature extraction model for long time period emotion speech using cnn
    Alhlffee M.
    Revue d'Intelligence Artificielle, 2020, 34 (02): : 117 - 123
  • [28] MFCC-based descriptor for bee queen presence detection
    Soares, Bianca Sousa
    Luz, Jederson Sousa
    de Macedo, Valderlandia Francisca
    Veloso e Silva, Romuere Rodrigues
    Duarte de Araujo, Flavio Henrique
    Vieira Magalhaes, Deborah Maria
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 201
  • [29] An MFCC-based Secure Framework for Voice Assistant Systems
    Ahmed, Syed Fahad
    Jaffari, Rabeea
    Ahmed, Syed Saad
    Jawaid, Moazzam
    Talpur, Shahnawaz
    2022 INTERNATIONAL CONFERENCE ON CYBER WARFARE AND SECURITY (ICCWS), 2022, : 57 - 61
  • [30] Improved Multitaper PNCC Feature for Robust Speaker Verification
    Liu, Yi
    He, Liang
    Liu, Jia
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 168 - 172