ROBUST SPEAKER IDENTIFICATION USING AN AUDITORY-BASED FEATURE

被引:28
|
作者
Li, Qi [1 ]
Huang, Yan [1 ]
机构
[1] Li Creat Technol LcT Inc, Florham Pk, NJ 07932 USA
关键词
Speech feature extraction; auditory-based feature; robust speaker recognition; speaker identification; cochlea; FILTER SHAPES; NOISE;
D O I
10.1109/ICASSP.2010.5495589
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An auditory-based feature extraction algorithm is presented. The feature is based on a recently published time-frequency transform plus a set of modules to simulate the signal processing functions in the cochlea. The feature is applied to a speaker identification task to address the acoustic mismatch problem between training and testing. Usually, the performances of acoustic models trained in clean speech drop significantly when tested on noisy speech. The proposed feature has shown strong robustness in the mismatched situation. As shown in our experiments, in a speaker identification task, both MFCC and the proposed feature have near perfect performances in a clean testing condition, but when the SNR of input signal drops to 6 dB, the average accuracy of the MFCC feature is only 41.2%, while the proposed feature still achieves an average accuracy of 88.3%.
引用
收藏
页码:4514 / 4517
页数:4
相关论文
共 50 条
  • [1] An Auditory-Based Feature Extraction Algorithm for Robust Speaker Identification Under Mismatched Conditions
    Li, Qi
    Huang, Yan
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1791 - 1801
  • [2] AN AUDITORY-BASED FEATURE FOR ROBUST SPEECH RECOGNITION
    Shao, Yang
    Jin, Zhaozhang
    Wang, DeLiang
    Srinivasan, Soundararajan
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4625 - +
  • [3] Robust Auditory-Based Speech Feature Extraction Using Independent Subspace Method
    Wu, Qiang
    Zhang, Liqing
    Xia, Bin
    [J]. ADVANCES IN COGNITIVE NEURODYNAMICS, PROCEEDINGS, 2008, : 405 - +
  • [4] Incorporating auditory feature uncertainties in robust speaker identification
    Shao, Yang
    Srinivasan, Soundararajan
    Wang, DeLiang
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 277 - +
  • [5] Robust speaker identification using auditory features and computational auditory scene analysis
    Shao, Yang
    Wang, DeLiang
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1589 - 1592
  • [6] Improved robust speaker identification in noise using auditory properties
    Hu, GR
    Wei, XD
    [J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 17 - 19
  • [7] Robust classification of stop consonants using auditory-based speech processing
    Ali, AMA
    Van der Spiegel, J
    Mueller, P
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 81 - 84
  • [8] Robust auditory-based speech processing using the average localized synchrony detection
    Ali, AMA
    Van der Spiegel, J
    Mueller, P
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (05): : 279 - 292
  • [9] Robust speaker identification based on selective use of feature vectors
    Kwon, Soonil
    Narayanan, Shrikanth
    [J]. PATTERN RECOGNITION LETTERS, 2007, 28 (01) : 85 - 89
  • [10] Improved MFCC-Based Feature for Robust Speaker Identification
    吴尊敬
    曹志刚
    [J]. Tsinghua Science and Technology, 2005, (02) : 158 - 161