Speaker recognition under stressed condition

被引：23

作者：

Senthil Raja G. ^{[1
]}

Dandapat S. ^{[1
]}

机构：

[1] IIT, Guwahati

来源：

International Journal of Speech Technology | 2010年 / 13卷 / 03期

关键词：

Speaker recognition; Stress compensation; Stressed speech;

D O I：

10.1007/s10772-010-9075-z

中图分类号：

学科分类号：

摘要：

This paper presents the feature analysis and design of compensators for speaker recognition under stressed speech conditions. Any condition that causes a speaker to vary his or her speech production from normal or neutral condition is called stressed speech condition. Stressed speech is induced by emotion, high workload, sleep deprivation, frustration and environmental noise. In stressed condition, the characteristics of speech signal are different from that of normal or neutral condition. Due to changes in speech signal characteristics, performance of the speaker recognition system may degrade under stressed speech conditions. Firstly, six speech features (mel-frequency cepstral coefficients (MFCC), linear prediction (LP) coefficients, linear prediction cepstral coefficients (LPCC), reflection coefficients (RC), arc-sin reflection coefficients (ARC) and log-area ratios (LAR)), which are widely used for speaker recognition, are analyzed for evaluation of their characteristics under stressed condition. Secondly, Vector Quantization (VQ) classifier and Gaussian Mixture Model (GMM) are used to evaluate speaker recognition results with different speech features. This analysis help select the best feature set for speaker recognition under stressed condition. Finally, four VQ based novel compensation techniques are proposed and evaluated for improvement of speaker recognition under stressed condition. The compensation techniques are speaker and stressed information based compensation (SSIC), compensation by removal of stressed vectors (CRSV), cepstral mean normalization (CMN) and combination of MFCC and sinusoidal amplitude (CMSA) features. Speech data from SUSAS database corresponding to four different stressed conditions, Angry, Lombard, Question and Neutral, are used for analysis of speaker recognition under stressed condition. © 2010 Springer Science+Business Media, LLC.

引用

页码：141 / 161

页数：20

共 50 条

[1] Speaker recognition under limited data condition by noise addition
Krishnamoorthy, P.
Jayanna, H. S.
Prasanna, S. R. M.
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 13487 - 13490
[2] An experimental comparison of modelling techniques for speaker recognition under limited data condition
H. S. Jayanna
S. R. Mahadeva Prasanna
[J]. Sadhana, 2009, 34 : 717 - 728
[3] An experimental comparison of modelling techniques for speaker recognition under limited data condition
Jayanna, H. S.
Prasanna, S. R. Mahadeva
[J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2009, 34 (05): : 717 - 728
[4] Multiple frame size and rate analysis for speaker recognition under limited data condition
Jayanna, H. S.
Prasanna, S. R. Mahadeva
[J]. IET SIGNAL PROCESSING, 2009, 3 (03) : 189 - 204
[5] Robust Speaker Recognition in Cross-channel Condition
Shan, Yuxiang
Liu, Jia
[J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4344 - 4348
[6] Improved Speaker Recognition System for Stressed Speech using Deep Neural Networks
Dumpala, Sri Harsha
Kopparapu, Sunil Kumar
[J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1257 - 1264
[7] Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition
Shon, Suwon
Mun, Seongkyu
Ko, Hanseok
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2869 - 2873
[8] Robust Spectral Features for Automatic Speaker Recognition in Mismatch Condition
Chougule, Sharada V.
Chavan, Mahesh S.
[J]. SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15), 2015, 58 : 272 - 279
[9] Speaker Recognition For Speech Under Face Cover
Saeidi, Rahim
Niemi, Tuija
Karppelin, Hanna
Pohjalainen, Jouni
Kinnunen, Tomi
Alku, Paavo
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1012 - 1016
[10] Speaker verification under degraded condition: A perceptual study
Pradhan G.
Mahadeva Prasanna S.R.
[J]. International Journal of Speech Technology, 2011, 14 (4) : 405 - 417

← 1 2 3 4 5 →