Spectral Restoration Based Speech Enhancement for Robust Speaker Identification

被引:5
|
作者
Saleem, Nasir [1 ]
Tareen, Tayyaba Gul [2 ]
机构
[1] Gomal Univ, Dept Elect Engn, Dera Ismail Khan, Pakistan
[2] Iqra Univ, Dept Elect Engn, Peshawar, Pakistan
关键词
A Priori SNR; Spectral Restoration; Speech Enhancement; Speaker Identification; Mel Frequency Cepstral Coefficients; Vector Quantization; SUBSPACE APPROACH;
D O I
10.9781/ijimai.2018.01.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spectral restoration based speech enhancement algorithms are used to enhance quality of noise masked speech for robust speaker identification. In presence of background noise, the performance of speaker identification systems can be severely deteriorated. The present study employed and evaluated the Minimum Mean-Square-Error Short-Time Spectral Amplitude Estimators with modified a priori SNR estimate prior to speaker identification to improve performance of the speaker identification systems in presence of background noise. For speaker identification, Mel Frequency Cepstral coefficient and Vector Quantization is used to extract the speech features and to model the extracted features respectively. The experimental results showed significant improvement in speaker identification rates when spectral restoration based speech enhancement algorithms are used as a pre-processing step. The identification rates are found to be higher after employing the speech enhancement algorithms.
引用
收藏
页码:34 / 39
页数:6
相关论文
共 50 条
  • [31] Robust several-speaker speech recognition with highly dependable online speaker adaptation and identification
    Shih, Po-Yi
    Lin, Po-Chuan
    Wang, Jhing-Fa
    Lin, Yuan-Ning
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (05) : 1459 - 1467
  • [32] Robust analysis and weighting on MFCC components for speech recognition and speaker identification
    Zhou, Xi
    Fu, Yun
    Liu, Ming
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 188 - 191
  • [33] CASA-Based Robust Speaker Identification
    Zhao, Xiaojia
    Shao, Yang
    Wang, DeLiang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1608 - 1616
  • [34] Robust FHPD Features from Speech Harmonic Analysis for Speaker Identification
    Wang, Shuiping
    Tang, Zhenmin
    Jiang, Ye
    Chen, Ying
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (04): : 1591 - 1598
  • [35] Robust Speaker Identification Based on Binaural Masks
    Ghalamiosgouei, Sina
    Geravanchizadeh, Masoud
    [J]. SPEECH COMMUNICATION, 2021, 132 (132) : 1 - 9
  • [36] Forensic speaker identification based on spectral moments
    Rodman, R
    McAllister, D
    Bitzer, D
    Cepeda, L
    Abbitt, P
    [J]. FORENSIC LINGUISTICS-THE INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2002, 9 (01): : 22 - 43
  • [37] Joint Speech Enhancement and Speaker Identification Using Monte Carlo Methods
    Maina, Ciira Wa
    Walsh, John MacLaren
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1359 - 1362
  • [38] Joint Speech Enhancement and Speaker Identification Using Approximate Bayesian Inference
    Maina, Ciira Wa
    Walsh, John MacLaren
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1517 - 1529
  • [39] Application of Slope Filtering to Robust Spectral Envelope Extraction for Speech/Speaker Recognition
    Drgas, Szymon
    Dabrowski, Adam
    [J]. HUMAN LANGUAGE TECHNOLOGY: CHALLENGES OF THE INFORMATION SOCIETY, 2009, 5603 : 13 - 23
  • [40] Recognizing the message and the messenger: Biomimetic spectral analysis for robust speech and speaker recognition
    Nemala S.K.
    Patil K.
    Elhilali M.
    [J]. International Journal of Speech Technology, 2013, 16 (03) : 313 - 322