Spectral Restoration Based Speech Enhancement for Robust Speaker Identification

被引：5

作者：

Saleem, Nasir ^{[1
]}

Tareen, Tayyaba Gul ^{[2
]}

机构：

[1] Gomal Univ, Dept Elect Engn, Dera Ismail Khan, Pakistan

[2] Iqra Univ, Dept Elect Engn, Peshawar, Pakistan

来源：

INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE | 2018年 / 5卷 / 01期

关键词：

A Priori SNR; Spectral Restoration; Speech Enhancement; Speaker Identification; Mel Frequency Cepstral Coefficients; Vector Quantization; SUBSPACE APPROACH;

D O I：

10.9781/ijimai.2018.01.002

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Spectral restoration based speech enhancement algorithms are used to enhance quality of noise masked speech for robust speaker identification. In presence of background noise, the performance of speaker identification systems can be severely deteriorated. The present study employed and evaluated the Minimum Mean-Square-Error Short-Time Spectral Amplitude Estimators with modified a priori SNR estimate prior to speaker identification to improve performance of the speaker identification systems in presence of background noise. For speaker identification, Mel Frequency Cepstral coefficient and Vector Quantization is used to extract the speech features and to model the extracted features respectively. The experimental results showed significant improvement in speaker identification rates when spectral restoration based speech enhancement algorithms are used as a pre-processing step. The identification rates are found to be higher after employing the speech enhancement algorithms.

引用

页码：34 / 39

页数：6

共 50 条

[31] Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification
Shih, Po-Yi
Lin, Po-Chuan
Wang, Jhing-Fa
Lin, Yuan-Ning
[J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (05) : 1459 - 1467
[32] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
Zhou, Xi
Fu, Yun
Liu, Ming
Hasegawa-Johnson, Mark
Huang, Thomas S.
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
[33] CASA-Based Robust Speaker Identification
Zhao, Xiaojia
Shao, Yang
Wang, DeLiang
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1608 - 1616
[34] Robust FHPD Features from Speech Harmonic Analysis for Speaker Identification
Wang, Shuiping
Tang, Zhenmin
Jiang, Ye
Chen, Ying
[J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (04): : 1591 - 1598
[35] Robust Speaker Identification Based on Binaural Masks
Ghalamiosgouei, Sina
Geravanchizadeh, Masoud
[J]. SPEECH COMMUNICATION, 2021, 132 (132) : 1 - 9
[36] Forensic speaker identification based on spectral moments
Rodman, R
McAllister, D
Bitzer, D
Cepeda, L
Abbitt, P
[J]. FORENSIC LINGUISTICS-THE INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2002, 9 (01): : 22 - 43
[37] Joint Speech Enhancement and Speaker Identification Using Monte Carlo Methods
Maina, Ciira Wa
Walsh, John MacLaren
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1359 - 1362
[38] Joint Speech Enhancement and Speaker Identification Using Approximate Bayesian Inference
Maina, Ciira Wa
Walsh, John MacLaren
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1517 - 1529
[39] Application of Slope Filtering to Robust Spectral Envelope Extraction for Speech/Speaker Recognition
Drgas, Szymon
Dabrowski, Adam
[J]. HUMAN LANGUAGE TECHNOLOGY: CHALLENGES OF THE INFORMATION SOCIETY, 2009, 5603 : 13 - 23
[40] Recognizing the message and the messenger: Biomimetic spectral analysis for robust speech and speaker recognition
Nemala S.K.
Patil K.
Elhilali M.
[J]. International Journal of Speech Technology, 2013, 16 (03) : 313 - 322

← 1 2 3 4 5 →