Reducing the environmental sensitivity of cepstral features for speaker recognition

被引：0

作者：

Openshaw, JP

Mason, JS

机构：

来源：

ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II | 1996年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for a-prioro knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-land filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively, a relative reduction in error of 77% and 60.1%.

引用

页码：721 / 724

页数：4

共 50 条

[41] Cepstral Features and Text-Dependent Speaker Identification A Comparative Study
Ouzounov, Atanas
[J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2010, 10 (01) : 3 - 12
[42] SPEAKER RECOGNITION BY STATISTICAL FEATURES AND DYNAMIC FEATURES
FURUI, S
[J]. REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1982, 30 (03): : 467 - 482
[43] Cepstral and Long-Term Features for Emotion Recognition
Dumouchel, Pierre
Dehak, Najim
Attabi, Yazid
Dehak, Reda
Boufaden, Narjes
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 344 - +
[44] Evaluation of Lineal Relation between Shifted Delta Cepstral Features and Prosodic Features in Speaker Verification
Calvo, Jose R.
Ribas, Dayana
Fernandez, Rafael
Hernandez, Gabriel
[J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 112 - 119
[45] Robust Speech Recognition Combining Cepstral and Articulatory Features
Zha, Zhuan-ling
Hu, Jin
Zhan, Qing-ran
Shan, Ya-hui
Xie, Xiang
Wang, Jing
Cheng, Hao-bo
[J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
[46] SPEAKER RECOGNITION USING SYLLABLE-BASED CONSTRAINTS FOR CEPSTRAL FRAME SELECTION
Bocklet, Tobias
Shriberg, Elizabeth
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4525 - +
[47] The wavelet packet based cepstral features for open set speaker classification in Marathi
Patil, HA
Dutta, PK
Basu, TK
[J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 134 - +
[48] Speaker Recognition Using Mel Frequency Cepstral Coefficient and Locality Sensitive Hashing
Awais, Ahmed
Kun, She
Yu, Yue
Hayat, Shaukat
Ahmed, Aftab
Tu, Tianyi
[J]. 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD), 2018, : 271 - 276
[49] The use of harmonic features in speaker recognition
Imperl, B
Kacic, Z
Horvat, B
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1131 - 1134
[50] A study of harmonic features for the speaker recognition
Imperl, B
Kacic, Z
Horvat, B
[J]. SPEECH COMMUNICATION, 1997, 22 (04) : 385 - 402

← 1 2 3 4 5 →