Attention to Emotions: Body Emotion Recognition In-the-Wild Using Self-attention Transformer Network

被引：0

作者：

Paiva, Pedro V. V. ^{[1
,3
]}

Ramos, Josue J. G. ^{[2
]}

Gavrilova, Marina ^{[3
]}

Carvalho, Marco A. G. ^{[1
]}

机构：

[1] Univ Estadual Campinas, Sch Technol, Limeira, Brazil

[2] Renato Archer IT Ctr, Cyber Phys Syst Div, Campinas, Brazil

[3] Univ Calgary, Dept Comp Sci, Calgary, AB, Canada

来源：

COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2023 | 2024年 / 2103卷

基金：

巴西圣保罗研究基金会; 加拿大自然科学与工程研究理事会;

关键词：

Body emotion recognition; Affective computing; Video and image processing; Gait analysis; Attention-based design; GRAPH CONVOLUTIONAL NETWORKS;

D O I：

10.1007/978-3-031-66743-5_10

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Body movements are an essential part of non-verbal communication as they help to express and interpret human emotions. The potential of Body Emotion Recognition (BER) is immense, as it can provide insights into user preferences, automate real-time exchanges and enable machines to respond to human emotions. BER finds applications in customer service, healthcare, entertainment, emotion-aware robots, and other areas. While face expression-based techniques are extensively researched, detecting emotions from body movements in the realworld presents several challenges, including variations in body posture, occlusions, and background. Recent research has established the efficacy of transformer deep-learning models beyond the language domain to solve video and image-related problems. A key component of transformers is the self-attention mechanism, which captures relationships among features across different spatial locations, allowing contextual information extraction. In this study, we aim to understand the role of body movements in emotion expression and to explore the use of transformer networks for body emotion recognition. Our method proposes a novel linear projection function of the visual transformer, which enables the transformation of 2D joint coordinates into a conventional matrix representation. Using an original method of contextual information learning, the developed approach enables a more accurate recognition of emotions by establishing unique correlations between individual's body motions over time. Our results demonstrated that the self-attention mechanism was able to achieve high accuracy in predicting emotions from body movements, surpassing the performance of other recent deep-learning methods. In addition, the impact of dataset size and frame rate on classification performance is analyzed.

引用

页码：206 / 228

页数：23

共 50 条

[41] Spectral Superresolution Using Transformer with Convolutional Spectral Self-Attention
Liao, Xiaomei
He, Lirong
Mao, Jiayou
Xu, Meng
REMOTE SENSING, 2024, 16 (10)
[42] CNN-TRANSFORMER WITH SELF-ATTENTION NETWORK FOR SOUND EVENT DETECTION
Wakayama, Keigo
Saito, Shoichiro
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 806 - 810
[43] EEG-Based Emotion Recognition Using Convolutional Recurrent Neural Network with Multi-Head Self-Attention
Hu, Zhangfang
Chen, Libujie
Luo, Yuan
Zhou, Jingfan
APPLIED SCIENCES-BASEL, 2022, 12 (21):
[44] Universal Graph Transformer Self-Attention Networks
Dai Quoc Nguyen
Tu Dinh Nguyen
Dinh Phung
COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 193 - 196
[45] Sparse self-attention transformer for image inpainting
Huang, Wenli
Deng, Ye
Hui, Siqi
Wu, Yang
Zhou, Sanping
Wang, Jinjun
PATTERN RECOGNITION, 2024, 145
[46] SST: self-attention transformer for infrared deconvolution
Gao, Lei
Yan, Xiaohong
Deng, Lizhen
Xu, Guoxia
Zhu, Hu
INFRARED PHYSICS & TECHNOLOGY, 2024, 140
[47] Att-Net: Enhanced emotion recognition system using lightweight self-attention module
Mustaqeem
Kwon, Soonil
APPLIED SOFT COMPUTING, 2021, 102
[48] Lite Vision Transformer with Enhanced Self-Attention
Yang, Chenglin
Wang, Yilin
Zhang, Jianming
Zhang, He
Wei, Zijun
Lin, Zhe
Yuille, Alan
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11988 - 11998
[49] Synthesizer: Rethinking Self-Attention for Transformer Models
Tay, Yi
Bahri, Dara
Metzler, Donald
Juan, Da-Cheng
Zhao, Zhe
Zheng, Che
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7192 - 7203
[50] Self-attention Bi-RNN for developer emotion recognition based on EEG
Wang, Yingdong
Zheng, Yuhui
Cao, Lu
Zhang, Zhiling
Ruan, Qunsehng
Wu, Qingfeng
IET SOFTWARE, 2022,

← 1 2 3 4 5 →