Stress detection using non-semantic speech representation

被引:6
|
作者
Kejriwal, Jay [1 ,2 ]
Benus, Stefan [1 ,3 ]
Trnka, Marian [1 ]
机构
[1] Slovak Acad Sci, Inst Informat, Bratislava, Slovakia
[2] Slovak Tech Univ, Fac Informat & Informat Technol, Bratislava, Slovakia
[3] Constantine Philosopher Univ, Nitra, Slovakia
关键词
stress detection; speech; classification; x-vectors; TRILL vector; MFCC feature; PLP feature; LLD feature;
D O I
10.1109/RADIOELEKTRONIKA54537.2022.9764916
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In today's world, stress has become a prominent cause for many ailments. Automatic detection of stress from speech using state-of-the-art machine learning algorithms can facilitate early detection and prevention of stress. Artificial intelligence agents involved in affective computing and human-machine spoken interaction (HMI) might benefit from the capacity to identify human stress automatically. Despite the fact that several different methods have been established for stress detection, it is still unclear which auditory features should be considered for training a deep neural network (DNN) model. In this study, we propose to investigate the performance of traditional and modern auditory features for stress classification using the StressDat database. The StressDat database is a collection of acted speech recordings in Slovak realizing sentences within stress-prone situations in three different levels of stress. The performance of traditional auditory features such as Mel-Frequency Cepstral Coefficients (MFCC) and Perceptual Linear Prediction (PLP) are compared with modern auditory non-semantic speech representation such as x-vectors and TRIpLet Loss network (TRILL) vectors. As a benchmark, Low-level descriptors (LLD) auditory features are extracted using the OpenSMILE toolkit. We evaluated performance of four different automatic classification algorithms: support vector machine (SVM), multilayer perceptron (MLP), convolutional neural network (CNN), and long shortterm memory (LSTM). The results reveal that TRILL vectors trained on CNN provide the highest accuracy (81.86%).
引用
收藏
页码:133 / 137
页数:5
相关论文
共 50 条
  • [31] Expected value of user sessions:: Limitations to the non-semantic approach
    Menasalvas, E
    Pardo, B
    Millan, S
    Hochsztain, E
    Peña, JM
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 562 - 565
  • [32] Neural network modelling of word production in Finnish: coding semantic and non-semantic features
    Jarvelin, A
    Juhola, M
    Laine, M
    NEURAL COMPUTING & APPLICATIONS, 2006, 15 (02): : 91 - 104
  • [33] Detection of Mild Cognitive Impairment From Non-Semantic, Acoustic Voice Features: The Framingham Heart Study
    Ding, Huitong
    Lister, Adrian
    Karjadi, Cody
    Au, Rhoda
    Lin, Honghuang
    Bischoff, Brian
    Hwang, Phillip H.
    JMIR AGING, 2024, 7
  • [34] Compiling a functional array language with non-semantic memory information
    Munksgaard, Philip
    Oancea, Cosmin
    Henriksen, Troels
    PROCEEDINGS OF THE 2022 34TH SYMPOSIUM ON IMPLEMENTATION AND APPLICATION OF FUNCTIONAL LANGUAGES, IFL 2022, 2022,
  • [35] Non-Semantic Evaluation of Image Forensics Tools: Methodology and Database
    Bammey, Quentin
    Nikoukhah, Tina
    Gardella, Marina
    von Gioi, Rafael Grompone
    Colom, Miguel
    Morel, Jean-Michel
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2383 - 2392
  • [36] EFFECTS OF TASK INSTRUCTIONS AND PASSAGE DIFFICULTY ON SEMANTIC AND NON-SEMANTIC PROCESSING OF LINGUISTIC IDEAS
    PIEDMONT, RL
    KAYSON, WA
    PSYCHOLOGICAL REPORTS, 1979, 45 (01) : 83 - 88
  • [37] Accuracy of Recognition of Non-Semantic Expression of Emotion: Norms for a Forensic Population
    Mangiameli, L. J.
    Greenhill, T. S.
    ARCHIVES OF CLINICAL NEUROPSYCHOLOGY, 2009, 24 (05) : 515 - 515
  • [38] Neural network modelling of word production in Finnish: coding semantic and non-semantic features
    Antti Järvelin
    Martti Juhola
    Matti Laine
    Neural Computing & Applications, 2006, 15 : 91 - 104
  • [39] Reading aloud: Dissociating the semantic pathway from the non-semantic pathway of the lexical route
    Francesca Peressotti
    Remo Job
    Reading and Writing, 2003, 16 (3) : 179 - 194
  • [40] ROLE OF NON-SEMANTIC INFORMATION IN MEMORY - ORTHOGRAPHIC DISTINCTIVENESS EFFECTS ON RETENTION
    HUNT, RR
    ELLIOTT, JM
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1980, 109 (01) : 49 - 74