Robust Speaker Recognition Using Improved GFCC and Adaptive Feature Selection

被引:0
|
作者
Zhang, Xingyu [1 ,2 ]
Zou, Xia [1 ,2 ]
Sun, Meng [1 ,2 ]
Wu, Penglong [1 ,2 ]
机构
[1] Army Engn Univ, Nanjing, Jiangsu, Peoples R China
[2] PLA Army Engn Univ, Lab Intelligent Informat Proc, Nanjing, Jiangsu, Peoples R China
关键词
Gammatone Frequency Cepstrum Coefficients (GFCC); i-vector; Robust speaker recognition; Mel-Frequency Cepstrum Coefficient (MFCC); Adaptive feature selection;
D O I
10.1007/978-3-030-16946-6_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speaker recognition systems have shown good performance in noise-free environments, but the performance will severely deteriorate in the presence of noises. At the front end of the systems, Mel-Frequency Cepstral Coefficient (MFCC), or a relatively noise-robust feature Gammatone Frequency Cepstral Coefficients (GFCC), is commonly used as time-frequency feature. To further improve the noise-robustness of GFCC, signal processing techniques, such as DC removal, pre-emphasis and Cepstral Mean Variance Normalization (CMVN), are investigated in the extraction of GFCC. Being aware the advantages and disadvantages of MFCC and GFCC, an adaptive strategy was proposed to make feature selection based on the quality of speech. Experiments were conducted on TIMIT dataset to evaluate our approach. Compared with ordinary GFCC and MFCC features, our method significantly reduced the EER in speech data with miscellaneous SNRs.
引用
收藏
页码:159 / 169
页数:11
相关论文
共 50 条
  • [21] Adaptive wavelet shrinkage for noise robust speaker recognition
    Govindan, Sumithra Manimegalai
    Duraisamy, Prakash
    Yuan, Xiaohui
    DIGITAL SIGNAL PROCESSING, 2014, 33 : 180 - 190
  • [22] Multistage face recognition using adaptive feature selection and classification
    Zuo, F
    de With, PHN
    van der Veen, M
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2005, 3708 : 17 - 25
  • [23] Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition
    Li, Lantian
    Lin, Yiye
    Zhang, Zhiyong
    Wang, Dong
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 426 - 429
  • [24] Integrated Feature Normalization and Enhancement for robust Speaker Recognition using Acoustic Factor Analysis
    Hasan, Taufiq
    Hansen, John H. L.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1566 - 1569
  • [25] A Modified MFCC Feature Extraction Technique For Robust Speaker Recognition
    Sharma, Diksha
    Ali, Israj
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1052 - 1057
  • [26] Feature Selection by Independent Component Analysis for Robust Speaker Verification
    Senturk, Ahmet
    Gurgen, Fikret S.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (3B): : 229 - 239
  • [27] Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives
    Cerva, Petr
    Silovsky, Jan
    Zdansky, Jindrich
    Nouza, Jan
    Seps, Ladislav
    SPEECH COMMUNICATION, 2013, 55 (10) : 1033 - 1046
  • [28] Improved speaker adaptation using speaker dependent feature projections
    Matsoukas, S
    Schwartz, R
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 273 - 278
  • [29] Improved MFCC-Based Feature for Robust Speaker Identification
    吴尊敬
    曹志刚
    Tsinghua Science and Technology, 2005, (02) : 158 - 161
  • [30] New Feature Vector based on GFCC for Language Recognition
    Chandrasekaram, B.
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (02) : 481 - 486