TEO-based speaker stress assessment using hybrid classification and tracking schemes

被引:4
|
作者
Hansen, John H.L. [1 ]
Ruzanski, Evan [1 ]
Bořil, Hynek [1 ]
Meyerhoff, James [1 ]
机构
[1] Center for Robust Speech Systems (CRSS), University of Texas at Dallas, 800 West Campbell Rd, EC33, Richardson, TX 75080-3021, United States
关键词
Stress assessment from speech; FLETC Corpus; TEO operator;
D O I
10.1007/s10772-012-9165-1
中图分类号
学科分类号
摘要
Speaker variability is known to have an adverse impact on speech systems that process linguistic content, such as speech and language recognition. However, speech production changes in individuals due to stress and emotions have similarly detrimental effect also on the task of speaker recognition as they introduce mismatch with the speaker models typically trained on modal speech. The focus of this study is on the analysis of stress-induced variations in speech and design of an automatic stress level assessment scheme that could be used in directing stress-dependent acoustic models or normalization strategies. Current stress detection methods typically employ a binary decision based on whether the speaker is or not under stress. In reality, the amount of stress in individuals varies and can change gradually. Using speech and biometric data collected in a real-world, variable-stress level law enforcement training scenario, this study considers two methods for stress level assessment. The first approach uses a nearest neighbor clustering scheme at the vowel token and sentence levels to classify speech data into three levels of stress. The second approach employs Euclidean distance metrics within the multi-dimensional feature space to provide real-time stress level tracking capability. Evaluations on audio data confirmed by biometric readings show both methods to be effective in assessment of stress level within a speaker (average accuracy of 55.6 % in a 3-way classification task). In addition, an impact of high-level stress on in-set speaker recognition is evaluated and shown to reduce the accuracy from 91.7 % (low/mid stress) to 21.4 % (high level stress).
引用
收藏
页码:295 / 311
页数:16
相关论文
共 50 条
  • [31] Stress Classification Using Artificial Neural Networks and Fatigue Life Assessment
    Jung, Sung-Wook
    Chang, Yoon-Suk
    Choi, Jae-Boong
    Kim, Young-Jin
    TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS A, 2006, 30 (05) : 520 - 527
  • [32] Multi-target Tracking Using a PHD-based Joint Tracking and Classification Algorithm
    Magnant, Clement
    Giremus, Audrey
    Grivel, Eric
    Ratton, Laurent
    Joseph, Bernard
    2016 IEEE RADAR CONFERENCE (RADARCONF), 2016, : 132 - 137
  • [33] USING WEIGHT-OF-EVIDENCE CLASSIFICATION SCHEMES IN THE ASSESSMENT OF NON-CANCER HEALTH RISKS
    PERLIN, SA
    MCCORMACK, C
    PROCEEDINGS OS THE 5TH NATIONAL CONFERENCE ON HAZARDOUS WASTES AND HAZARDOUS MATERIALS - HWHM 88, 1988, : 482 - 486
  • [34] Speaker-invariant Psychological Stress Detection Using Attention-based Network
    Shin, Hyeon-Kyeong
    Han, Hyewon
    Byun, Kyungguen
    Kang, Hong-Goo
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 308 - 313
  • [35] Semantic video object tracking using region-based classification
    Gu, C
    Lee, MC
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, 1998, : 643 - 647
  • [36] SPEAKER ACCENT CLASSIFICATION SYSTEM USING FUZZY CANONICAL CORRELATION-BASED GAUSSIAN CLASSIFIER
    Ullah, Sameeh
    Karray, Fakhri
    ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 792 - 795
  • [37] ONLINE UNSUPERVISED OVERLAPPING SPEAKER DETECTION USING ENHANCED CLASSIFICATION HISTORY-BASED FEATURES
    Oualil, Youssef
    Toroghi, Rahil Mahdian
    Klakow, Dietrich
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 228 - 232
  • [38] Multiple Object Classification Using Hybrid Saliency Based Descriptors
    Jalilvand, Ali
    Charkari, Nasrollah Moghadam
    KNOWLEDGE TECHNOLOGY, 2012, 295 : 348 - 351
  • [39] Classification of flying object based on radar data using hybrid
    Mandal, Priti
    Roy, Lakshi Prosad
    Das, Santos Kumar
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 107
  • [40] Intrusion detection using neural based hybrid classification methods
    Govindarajan, M.
    Chandrasekaran, R. M.
    COMPUTER NETWORKS, 2011, 55 (08) : 1662 - 1671