Stress detection using non-semantic speech representation

被引:6
|
作者
Kejriwal, Jay [1 ,2 ]
Benus, Stefan [1 ,3 ]
Trnka, Marian [1 ]
机构
[1] Slovak Acad Sci, Inst Informat, Bratislava, Slovakia
[2] Slovak Tech Univ, Fac Informat & Informat Technol, Bratislava, Slovakia
[3] Constantine Philosopher Univ, Nitra, Slovakia
关键词
stress detection; speech; classification; x-vectors; TRILL vector; MFCC feature; PLP feature; LLD feature;
D O I
10.1109/RADIOELEKTRONIKA54537.2022.9764916
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In today's world, stress has become a prominent cause for many ailments. Automatic detection of stress from speech using state-of-the-art machine learning algorithms can facilitate early detection and prevention of stress. Artificial intelligence agents involved in affective computing and human-machine spoken interaction (HMI) might benefit from the capacity to identify human stress automatically. Despite the fact that several different methods have been established for stress detection, it is still unclear which auditory features should be considered for training a deep neural network (DNN) model. In this study, we propose to investigate the performance of traditional and modern auditory features for stress classification using the StressDat database. The StressDat database is a collection of acted speech recordings in Slovak realizing sentences within stress-prone situations in three different levels of stress. The performance of traditional auditory features such as Mel-Frequency Cepstral Coefficients (MFCC) and Perceptual Linear Prediction (PLP) are compared with modern auditory non-semantic speech representation such as x-vectors and TRIpLet Loss network (TRILL) vectors. As a benchmark, Low-level descriptors (LLD) auditory features are extracted using the OpenSMILE toolkit. We evaluated performance of four different automatic classification algorithms: support vector machine (SVM), multilayer perceptron (MLP), convolutional neural network (CNN), and long shortterm memory (LSTM). The results reveal that TRILL vectors trained on CNN provide the highest accuracy (81.86%).
引用
收藏
页码:133 / 137
页数:5
相关论文
共 50 条
  • [41] ADULT AGE-DIFFERENCES IN RECOGNITION MEMORY FOR A NON-SEMANTIC ATTRIBUTE
    KAUSLER, DH
    PUCKETT, JM
    EXPERIMENTAL AGING RESEARCH, 1980, 6 (04) : 349 - 355
  • [42] INCIDENTAL-LEARNING OF ASSOCIATIONS DURING SEMANTIC AND NON-SEMANTIC PROCESSING - IS CONTIGUITY A SUFFICIENT FACTOR
    ZECHMEISTER, EB
    CURT, C
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1976, 8 (04) : 246 - 246
  • [43] Functional Dissociations of the Left Anterior and Posterior Occipitotemporal Cortex for Semantic and Non-semantic Phonological Access
    Dong, Jie
    Lu, Chengrou
    Chen, Chuansheng
    Li, Huiling
    Liu, Xiaoyu
    Mei, Leilei
    NEUROSCIENCE, 2020, 430 : 94 - 104
  • [44] Preserved reading aloud with semantic deficits: Evidence for a non-semantic lexical route for reading Chinese
    Law, SP
    Wong, WS
    Chiu, KMY
    NEUROCASE, 2005, 11 (03) : 167 - 175
  • [45] STRUCTURE OF MEMORY TRACES FOLLOWING SEMANTIC AND NON-SEMANTIC ORIENTATION TASKS IN INCIDENTAL-LEARNING
    ARBUCKLE, TY
    KATZ, WA
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN LEARNING AND MEMORY, 1976, 2 (04): : 362 - 369
  • [46] Fodor on concepts and Frege puzzles (Mode of presentation, non-semantic solutions)
    Aydede, M
    PACIFIC PHILOSOPHICAL QUARTERLY, 1998, 79 (04): : 289 - 294
  • [47] SEMANTIC AND NON-SEMANTIC INPUT AND OUTPUT PROCESSING AND CONTEXT EFFECTS IN IMMEDIATE AND DELAYED RECOGNITION MEMORY
    SCHWARZ, W
    BATTIG, WF
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1976, 8 (04) : 258 - 258
  • [48] Aggression Detection in Speech Using Sensor and Semantic Information
    Lefter, Iulia
    Rothkrantz, Leon J. M.
    Burghouts, Gertjan J.
    TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 665 - 672
  • [49] Stress Recognition using Sparse Representation of Speech Signal for Deception Detection Applications in Indian Context
    Varsha, Aswathi K. T. K.
    Lalitha, S.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2017, : 60 - 66
  • [50] A knowledge extraction process specification for today's non-semantic web
    Arjona, JL
    Corchuelo, R
    Toro, M
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 61 - 67