ROBUST SPEAKER IDENTIFICATION USING AN AUDITORY-BASED FEATURE

被引：28

作者：

Li, Qi ^{[1
]}

Huang, Yan ^{[1
]}

机构：

[1] Li Creat Technol LcT Inc, Florham Pk, NJ 07932 USA

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

关键词：

Speech feature extraction; auditory-based feature; robust speaker recognition; speaker identification; cochlea; FILTER SHAPES; NOISE;

D O I：

10.1109/ICASSP.2010.5495589

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

An auditory-based feature extraction algorithm is presented. The feature is based on a recently published time-frequency transform plus a set of modules to simulate the signal processing functions in the cochlea. The feature is applied to a speaker identification task to address the acoustic mismatch problem between training and testing. Usually, the performances of acoustic models trained in clean speech drop significantly when tested on noisy speech. The proposed feature has shown strong robustness in the mismatched situation. As shown in our experiments, in a speaker identification task, both MFCC and the proposed feature have near perfect performances in a clean testing condition, but when the SNR of input signal drops to 6 dB, the average accuracy of the MFCC feature is only 41.2%, while the proposed feature still achieves an average accuracy of 88.3%.

引用

页码：4514 / 4517

页数：4

共 50 条

[1] An Auditory-Based Feature Extraction Algorithm for Robust Speaker Identification Under Mismatched Conditions
Li, Qi
Huang, Yan
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1791 - 1801
[2] AN AUDITORY-BASED FEATURE FOR ROBUST SPEECH RECOGNITION
Shao, Yang
Jin, Zhaozhang
Wang, DeLiang
Srinivasan, Soundararajan
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4625 - +
[3] Robust Auditory-Based Speech Feature Extraction Using Independent Subspace Method
Wu, Qiang
Zhang, Liqing
Xia, Bin
[J]. ADVANCES IN COGNITIVE NEURODYNAMICS, PROCEEDINGS, 2008, : 405 - +
[4] Incorporating auditory feature uncertainties in robust speaker identification
Shao, Yang
Srinivasan, Soundararajan
Wang, DeLiang
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 277 - +
[5] Robust speaker identification using auditory features and computational auditory scene analysis
Shao, Yang
Wang, DeLiang
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1589 - 1592
[6] Improved robust speaker identification in noise using auditory properties
Hu, GR
Wei, XD
[J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 17 - 19
[7] Robust classification of stop consonants using auditory-based speech processing
Ali, AMA
Van der Spiegel, J
Mueller, P
[J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 81 - 84
[8] Robust auditory-based speech processing using the average localized synchrony detection
Ali, AMA
Van der Spiegel, J
Mueller, P
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (05): : 279 - 292
[9] Robust speaker identification based on selective use of feature vectors
Kwon, Soonil
Narayanan, Shrikanth
[J]. PATTERN RECOGNITION LETTERS, 2007, 28 (01) : 85 - 89
[10] Improved MFCC-Based Feature for Robust Speaker Identification
吴尊敬
曹志刚
[J]. Tsinghua Science and Technology, 2005, (02) : 158 - 161

← 1 2 3 4 5 →