Experimental Evaluation of Features for Robust Speaker Identification

被引：145

作者：

Reynolds, Douglas A. ^{[1
]}

机构：

[1] MIT, Lincoln Lab, Lexington, MA 02173 USA

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1994年 / 2卷 / 04期

关键词：

Communication channels (information theory) - Database systems - Digital filters - Identification (control systems) - Iterative methods - Mathematical models - Matrix algebra - Robustness (control systems) - Speech analysis - Speech processing - Vectors;

D O I：

10.1109/89.326623

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This correspondence presents an experimental evaluation of different features and channel compensation techniques for robust speaker identification. The goal is to keep all processing and classification steps constant and to vary only the features and compensations used to allow a controlled comparison. A general, maximum-likelihood classifier based on Gaussian mixture densities is used as the classifier, and experiments are conducted on the King speech database, a conversational, telephone-speech database. The features examined are mel-frequency and linear-frequency filterbank cepstral coefficients, linear prediction cepstral coefficients, and perceptual linear prdiction (PLP) cepstral coefficients. The channel compensation techniques examined are cepstral mean removal, RASTA processing, and a quadratic trend removal technique. It is shown for this database that performance differences between the basic features is small, and the major gains are due to the channel compensation techniques. The best "across-the-divide" recognition accuracy of 92% is obtained for both high-order LPC features and band-limited filterbank features.

引用

页码：639 / 643

页数：5

共 50 条

[1] Fusion features for robust speaker identification
Ben Fredj, Ines
Zouhir, Youssef
Ouni, Kais
[J]. INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2018, 11 (02) : 65 - 72
[2] Robust Q Features for Speaker Identification
Deshpande, Mangesh S.
Holambe, Raghunath S.
[J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 209 - 213
[3] Bispectrum features for robust speaker identification
Wenndt, S
Shamsunder, S
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1095 - 1098
[4] Robust prosodic features for speaker identification
Carey, MJ
Parris, ES
LloydThomas, H
Bennett, S
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1800 - 1803
[5] Modulation Features for Noise Robust Speaker Identification
Mitra, Vikramjit
McLaren, Mitchel
Franco, Horacio
Graciarena, Martin
Scheffer, Nicolas
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3670 - 3674
[6] Robust Speaker Identification Incorporating High Frequency Features
Latha
[J]. TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 804 - 811
[7] Robust lip-motion features for speaker identification
Çetingül, HE
Yemez, Y
Erzin, E
Tekalp, AM
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 509 - 512
[8] Speaker Identification based on Robust AM-FM Features
Deshpande, Mangesh S.
Holambe, Raghunath S.
[J]. 2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
[9] Robust speech features based on wavelet transform with application to speaker identification
Hsieh, CT
Lai, E
Wang, YC
[J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2002, 149 (02): : 108 - 114
[10] Robust FHPD Features from Speech Harmonic Analysis for Speaker Identification
Wang, Shuiping
Tang, Zhenmin
Jiang, Ye
Chen, Ying
[J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (04): : 1591 - 1598

← 1 2 3 4 5 →