Robust Speaker Recognition Using Improved GFCC and Adaptive Feature Selection

被引:0
|
作者
Zhang, Xingyu [1 ,2 ]
Zou, Xia [1 ,2 ]
Sun, Meng [1 ,2 ]
Wu, Penglong [1 ,2 ]
机构
[1] Army Engn Univ, Nanjing, Jiangsu, Peoples R China
[2] PLA Army Engn Univ, Lab Intelligent Informat Proc, Nanjing, Jiangsu, Peoples R China
关键词
Gammatone Frequency Cepstrum Coefficients (GFCC); i-vector; Robust speaker recognition; Mel-Frequency Cepstrum Coefficient (MFCC); Adaptive feature selection;
D O I
10.1007/978-3-030-16946-6_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speaker recognition systems have shown good performance in noise-free environments, but the performance will severely deteriorate in the presence of noises. At the front end of the systems, Mel-Frequency Cepstral Coefficient (MFCC), or a relatively noise-robust feature Gammatone Frequency Cepstral Coefficients (GFCC), is commonly used as time-frequency feature. To further improve the noise-robustness of GFCC, signal processing techniques, such as DC removal, pre-emphasis and Cepstral Mean Variance Normalization (CMVN), are investigated in the extraction of GFCC. Being aware the advantages and disadvantages of MFCC and GFCC, an adaptive strategy was proposed to make feature selection based on the quality of speech. Experiments were conducted on TIMIT dataset to evaluate our approach. Compared with ordinary GFCC and MFCC features, our method significantly reduced the EER in speech data with miscellaneous SNRs.
引用
收藏
页码:159 / 169
页数:11
相关论文
共 50 条
  • [11] A robust feature based on sparse representation for speaker recognition
    Xie, Yining
    Huang, Jinjie
    Wang, Xinlei
    Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561
  • [12] A COCHLEAR NEURON BASED ROBUST FEATURE FOR SPEAKER RECOGNITION
    You, Datao
    Jiang, Tao
    Han, Jiqing
    Zheng, Tieran
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5440 - 5443
  • [13] An Auditory Feature Extraction Method for Robust Speaker Recognition
    Hu, Fengsong
    Cao, Xiaoyu
    PROCEEDINGS OF 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, 2012, : 1067 - 1071
  • [14] Robust feature extraction from spectrum estimated using bispectrum for speaker recognition
    Ajmera P.K.
    Nehe N.S.
    Jadhav D.V.
    Holambe R.S.
    International Journal of Speech Technology, 2012, 15 (3) : 433 - 440
  • [15] Improved Multitaper PNCC Feature for Robust Speaker Verification
    Liu, Yi
    He, Liang
    Liu, Jia
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 168 - 172
  • [16] Emotion recognition using semi-supervised feature selection with speaker normalization
    Sun Y.
    Wen G.
    International Journal of Speech Technology, 2015, 18 (3) : 317 - 331
  • [17] Robust Character Recognition Using Adaptive Feature Extraction Method
    Mori, Minoru
    Sawaki, Minako
    Yamato, Junji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (01): : 125 - 133
  • [18] Wavelet feature selection using fuzzy approach to text independent speaker recognition
    Lung, SY
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (03) : 779 - 781
  • [19] A Robust Speaker Identification System Based on the Combination of GFCC and MFCC Methods
    Bachir Tazi, El
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2016, : 54 - 58
  • [20] DTW-based feature selection for speech recognition and speaker recognition
    Liu, Jing-Wei
    Xu, Mei-Zhi
    Zheng, Zhong-Guo
    Cheng, Qian-Sheng
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2005, 18 (01): : 50 - 54