Improved MFCC-Based Feature for Robust Speaker Identification

被引:11
|
作者
吴尊敬
曹志刚
机构
[1] China
[2] Department of Electronic Engineering
[3] Tsinghua University
[4] State Key Laboratory on Microwave and Digital Communications
[5] Beijing 100084
基金
中国国家自然科学基金;
关键词
Mel-frequency cepstral coefficient (MFCC); robust speaker identification; feature extraction;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the performance of recognition systems because of the mismatches between training and testing. In this paper, the logarithmic transformation in the standard MFCC analysis is replaced by a combined function to improve the noisy sensitivity. The proposed feature extraction process is also combined with speech en- hancement methods, such as spectral subtraction and median-filter to further suppress the noise. Experi- ments show that the proposed robust MFCC-based feature significantly reduces the recognition error rate over a wide signal-to-noise ratio range.
引用
收藏
页码:158 / 161
页数:4
相关论文
共 50 条
  • [1] An MFCC-based Speaker Identification System
    Leu, Fang-Yie
    Lin, Guan-Liang
    2017 IEEE 31ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2017, : 1055 - 1062
  • [2] Hardware Implementation of MFCC-Based Feature Extraction for Speaker Recognition
    Ehkan, P.
    Zakaria, F. F.
    Warip, M. N. M.
    Sauli, Z.
    Elshaikh, M.
    ADVANCED COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY, 2015, 315 : 471 - 480
  • [3] Evaluating MFCC-based speaker identification systems with data envelopment analysis
    Ozcan, Zubeyir
    Kayikcioglu, Temel
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [4] An MFCC-based text-independent speaker identification system for access control
    Liu, Jung-Chun
    Leu, Fang-Yie
    Lin, Guan-Liang
    Susanto, Heru
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (02):
  • [5] Detection of Epilepsy Using MFCC-based Feature and XGBoost
    Long, Jie-min
    Yan, Zhang-fa
    Shen, Yu-lin
    Liu, Wei-jun
    Wei, Qing-yang
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [6] Accuracy of MFCC-Based Speaker Recognition in Series 60 Device
    Juhani Saastamoinen
    Evgeny Karpov
    Ville Hautamäki
    Pasi Fränti
    EURASIP Journal on Advances in Signal Processing, 2005
  • [7] Accuracy of MFCC-based speaker recognition in Series 60 device
    Saastamoinen, J
    Karpov, E
    Hautamäki, V
    Fränti, P
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (17) : 2816 - 2827
  • [8] Searching for a robust MFCC-based parameterization for ASR application
    Psutka, J. V.
    Smidl, Lubos
    Prazak, Ales
    SIGMAP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2007, : 196 - +
  • [9] Multitaper Based MFCC Feature Extraction for Robust Speaker Recognition System
    Bharath, K. P.
    Kumar, Rajesh M.
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,
  • [10] Speech emotion recognition using MFCC-based entropy feature
    Siba Prasad Mishra
    Pankaj Warule
    Suman Deb
    Signal, Image and Video Processing, 2024, 18 : 153 - 161