Self-Relation Attention and Temporal Awareness for Emotion Recognition via Vocal Burst

被引:2
|
作者
Trinh, Dang-Linh [1 ]
Vo, Minh-Cong [1 ]
Kim, Soo-Hyung [1 ]
Yang, Hyung-Jeong [1 ]
Lee, Guee-Sang [1 ]
机构
[1] Chonnam Natl Univ, Dept Artificial Intelligence Convergence, 77 Yongbong Ro, Gwangju 500757, South Korea
基金
新加坡国家研究基金会;
关键词
vocal burst; self-supervised model; self-relation attention; temporal awareness; SPEECH; VOICE;
D O I
10.3390/s23010200
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Speech emotion recognition (SER) is one of the most exciting topics many researchers have recently been involved in. Although much research has been conducted recently on this topic, emotion recognition via non-verbal speech (known as the vocal burst) is still sparse. The vocal burst is concise and has meaningless content, which is harder to deal with than verbal speech. Therefore, in this paper, we proposed a self-relation attention and temporal awareness (SRA-TA) module to tackle this problem with vocal bursts, which could capture the dependency in a long-term period and focus on the salient parts of the audio signal as well. Our proposed method contains three main stages. Firstly, the latent features are extracted using a self-supervised learning model from the raw audio signal and its Mel-spectrogram. After the SRA-TA module is utilized to capture the valuable information from latent features, all features are concatenated and fed into ten individual fully-connected layers to predict the scores of 10 emotions. Our proposed method achieves a mean concordance correlation coefficient (CCC) of 0.7295 on the test set, which achieves the first ranking of the high-dimensional emotion task in the 2022 ACII Affective Vocal Burst Workshop & Challenge.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Self-relation attention networks for weakly supervised few-shot activity recognition
    Deng, Shizhuo
    Guo, Zhubao
    Teng, Da
    Lin, Boqian
    Chen, Dongyue
    Jia, Tong
    Wang, Hao
    KNOWLEDGE-BASED SYSTEMS, 2023, 276
  • [2] A Multi-head Self-relation Network for Scene Text Recognition
    Zhou, Junwei
    Gao, Hongchao
    Dai, Jiao
    Liu, Dongqin
    Han, Jizhong
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3969 - 3976
  • [3] EEG-Based Emotion Recognition With Emotion Localization via Hierarchical Self-Attention
    Zhang, Yuzhe
    Liu, Huan
    Zhang, Dalin
    Chen, Xuxu
    Qin, Tao
    Zheng, Qinghua
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2458 - 2469
  • [4] Self-Awareness After Brain Injury: Relation with Emotion Recognition and Effects of Treatment
    Lamberts, K. F.
    Fasotti, L.
    Boelen, D. H. E.
    Spikman, J. M.
    BRAIN IMPAIRMENT, 2017, 18 (01) : 130 - 137
  • [5] EEG-Based Emotion Recognition via Channel-Wise Attention and Self Attention
    Tao, Wei
    Li, Chang
    Song, Rencheng
    Cheng, Juan
    Liu, Yu
    Wan, Feng
    Chen, Xun
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) : 382 - 393
  • [6] Self-attention for Speech Emotion Recognition
    Tarantino, Lorenzo
    Garner, Philip N.
    Lazaridis, Alexandros
    INTERSPEECH 2019, 2019, : 2578 - 2582
  • [7] Attention emotion recognition via ECG signals
    Mao, Aihua
    Du, Zihui
    Lu, Dayu
    Luo, Jie
    QUANTITATIVE BIOLOGY, 2022, 10 (03) : 276 - 286
  • [8] Attention emotion recognition via ECG signals
    Aihua Mao
    Zihui Du
    Dayu Lu
    Jie Luo
    Quantitative Biology, 2022, 10 (03) : 276 - 286
  • [9] MULTIMODAL ATTENTION-MECHANISM FOR TEMPORAL EMOTION RECOGNITION
    Ghaleb, Esam
    Niehues, Jan
    Asteriadis, Stylianos
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 251 - 255
  • [10] Emotion Recognition with Spatial Attention and Temporal Softmax Pooling
    Aminbeidokhti, Masih
    Pedersoli, Marco
    Cardinal, Patrick
    Granger, Eric
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 323 - 331