Speaker recognition under stressed condition

被引:23
|
作者
Senthil Raja G. [1 ]
Dandapat S. [1 ]
机构
[1] IIT, Guwahati
关键词
Speaker recognition; Stress compensation; Stressed speech;
D O I
10.1007/s10772-010-9075-z
中图分类号
学科分类号
摘要
This paper presents the feature analysis and design of compensators for speaker recognition under stressed speech conditions. Any condition that causes a speaker to vary his or her speech production from normal or neutral condition is called stressed speech condition. Stressed speech is induced by emotion, high workload, sleep deprivation, frustration and environmental noise. In stressed condition, the characteristics of speech signal are different from that of normal or neutral condition. Due to changes in speech signal characteristics, performance of the speaker recognition system may degrade under stressed speech conditions. Firstly, six speech features (mel-frequency cepstral coefficients (MFCC), linear prediction (LP) coefficients, linear prediction cepstral coefficients (LPCC), reflection coefficients (RC), arc-sin reflection coefficients (ARC) and log-area ratios (LAR)), which are widely used for speaker recognition, are analyzed for evaluation of their characteristics under stressed condition. Secondly, Vector Quantization (VQ) classifier and Gaussian Mixture Model (GMM) are used to evaluate speaker recognition results with different speech features. This analysis help select the best feature set for speaker recognition under stressed condition. Finally, four VQ based novel compensation techniques are proposed and evaluated for improvement of speaker recognition under stressed condition. The compensation techniques are speaker and stressed information based compensation (SSIC), compensation by removal of stressed vectors (CRSV), cepstral mean normalization (CMN) and combination of MFCC and sinusoidal amplitude (CMSA) features. Speech data from SUSAS database corresponding to four different stressed conditions, Angry, Lombard, Question and Neutral, are used for analysis of speaker recognition under stressed condition. © 2010 Springer Science+Business Media, LLC.
引用
收藏
页码:141 / 161
页数:20
相关论文
共 50 条
  • [1] Speaker recognition under limited data condition by noise addition
    Krishnamoorthy, P.
    Jayanna, H. S.
    Prasanna, S. R. M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 13487 - 13490
  • [2] An experimental comparison of modelling techniques for speaker recognition under limited data condition
    H. S. Jayanna
    S. R. Mahadeva Prasanna
    [J]. Sadhana, 2009, 34 : 717 - 728
  • [3] An experimental comparison of modelling techniques for speaker recognition under limited data condition
    Jayanna, H. S.
    Prasanna, S. R. Mahadeva
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2009, 34 (05): : 717 - 728
  • [4] Multiple frame size and rate analysis for speaker recognition under limited data condition
    Jayanna, H. S.
    Prasanna, S. R. Mahadeva
    [J]. IET SIGNAL PROCESSING, 2009, 3 (03) : 189 - 204
  • [5] Robust Speaker Recognition in Cross-channel Condition
    Shan, Yuxiang
    Liu, Jia
    [J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4344 - 4348
  • [6] Improved Speaker Recognition System for Stressed Speech using Deep Neural Networks
    Dumpala, Sri Harsha
    Kopparapu, Sunil Kumar
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1257 - 1264
  • [7] Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition
    Shon, Suwon
    Mun, Seongkyu
    Ko, Hanseok
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2869 - 2873
  • [8] Robust Spectral Features for Automatic Speaker Recognition in Mismatch Condition
    Chougule, Sharada V.
    Chavan, Mahesh S.
    [J]. SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15), 2015, 58 : 272 - 279
  • [9] Speaker Recognition For Speech Under Face Cover
    Saeidi, Rahim
    Niemi, Tuija
    Karppelin, Hanna
    Pohjalainen, Jouni
    Kinnunen, Tomi
    Alku, Paavo
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1012 - 1016
  • [10] Speaker verification under degraded condition: A perceptual study
    Pradhan G.
    Mahadeva Prasanna S.R.
    [J]. International Journal of Speech Technology, 2011, 14 (4) : 405 - 417