Improved MFCC-Based Feature for Robust Speaker Identification

被引：11

作者：

吴尊敬

曹志刚

机构：

[1] China

[2] Department of Electronic Engineering

[3] Tsinghua University

[4] State Key Laboratory on Microwave and Digital Communications

[5] Beijing 100084

来源：

Tsinghua Science and Technology | 2005年 / 02期

基金：

中国国家自然科学基金;

关键词：

Mel-frequency cepstral coefficient (MFCC); robust speaker identification; feature extraction;

D O I：

暂无

中图分类号：

TN912.3 [语音信号处理];

学科分类号：

0711 ;

摘要：

The Mel-frequency cepstral coefficient (MFCC) is the most widely used feature in speech and speaker recognition. However, MFCC is very sensitive to noise interference, which tends to drastically de- grade the performance of recognition systems because of the mismatches between training and testing. In this paper, the logarithmic transformation in the standard MFCC analysis is replaced by a combined function to improve the noisy sensitivity. The proposed feature extraction process is also combined with speech en- hancement methods, such as spectral subtraction and median-filter to further suppress the noise. Experi- ments show that the proposed robust MFCC-based feature significantly reduces the recognition error rate over a wide signal-to-noise ratio range.

引用

页码：158 / 161

页数：4

共 50 条

[31] An Improved Ranking-Based Feature Enhancement Approach for Robust Speaker Recognition
Yan, Furong
Men, Aidong
Yang, Bo
Jiang, Zhuqing
IEEE ACCESS, 2016, 4 : 5258 - 5267
[32] Cardiac sound classification using a hybrid approach: MFCC-based feature fusion and CNN deep features
Mahbubeh Bahreini
Ramin Barati
Abbas Kamali
EURASIP Journal on Advances in Signal Processing, 2025 (1)
[33] MISSING FEATURE RECONSTRUCTION METHODS FOR ROBUST SPEAKER IDENTIFICATION
Zhang, Xueliang
Zhang, Hui
Gao, Guanglai
2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1482 - 1486
[34] Incorporating auditory feature uncertainties in robust speaker identification
Shao, Yang
Srinivasan, Soundararajan
Wang, DeLiang
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 277 - +
[35] Acoustic feature extraction method for robust speaker identification
Zuoqiang Li
Yong Gao
Multimedia Tools and Applications, 2016, 75 : 7391 - 7406
[36] A Feature Level Fusion Scheme for Robust Speaker Identification
Sekkate, Sara
Khalil, Mohammed
Adib, Abdellah
BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 289 - 300
[37] Spectral-temporal receptive fields and MFCC balanced feature extraction for robust speaker recognition
Wang, Jia-Ching
Wang, Chien-Yao
Chin, Yu-Hao
Liu, Yu-Ting
Chen, En-Ting
Chang, Pao-Chi
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) : 4055 - 4068
[38] Spectral-temporal receptive fields and MFCC balanced feature extraction for robust speaker recognition
Jia-Ching Wang
Chien-Yao Wang
Yu-Hao Chin
Yu-Ting Liu
En-Ting Chen
Pao-Chi Chang
Multimedia Tools and Applications, 2017, 76 : 4055 - 4068
[39] ROBUST FEATURE FRONT-END FOR SPEAKER IDENTIFICATION
Liu, Gang
Lei, Yun
Hansen, John H. L.
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4233 - 4236
[40] Acoustic feature extraction method for robust speaker identification
Li, Zuoqiang
Gao, Yong
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (12) : 7391 - 7406

← 1 2 3 4 5 →