Multi-Attention Fusion Network for Video-based Emotion Recognition

被引:23
|
作者
Wang, Yanan [1 ]
Wu, Jianming [1 ]
Hoashi, Keiichiro [1 ]
机构
[1] KDDI Res Inc, Saitama, Japan
关键词
Emotion recognition; Multimodal; Attention mechanism; Multimodal domain adaptation; Fusion network;
D O I
10.1145/3340555.3355720
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Humans routinely pay attention to important emotion information from visual and audio modalities without considering multimodal alignment issues, and recognize emotions by integrating important multimodal information at a certain interval. In this paper, we propose a multiple attention fusion network (MAFN) with the goal of improving emotion recognition performance by modeling human emotion recognition mechanisms. MAFN consists of two types of attention mechanisms: the intra-modality attention mechanism is applied to dynamically extract representative emotion features from a single modal frame sequences; the inter-modality attention mechanism is applied to automatically highlight specific modal features based on their importance. In addition, we define a multimodal domain adaptation method to have a positive effect on capturing interactions between modalities. MAFN achieved 58.65% recognition accuracy with the AFEW testing set, which is a significant improvement compared with the baseline of 41.07%.
引用
收藏
页码:595 / 601
页数:7
相关论文
共 50 条
  • [41] MuAt-Va: Multi-Attention and Video-Auxiliary Network for Device-Free Action Recognition
    Sheng, Biyun
    Sun, Chaorun
    Xiao, Fu
    Gui, Linqing
    Guo, Zhengxin
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (12): : 10870 - 10880
  • [42] Multi-attention network for pedestrian intention prediction based on spatio-temporal feature fusion
    Zhang, Xiaofei
    Wang, Xiaolan
    Zhang, Weiwei
    Wang, Yansong
    Liu, Xintian
    Wei, Dan
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024, 238 (13) : 4202 - 4215
  • [43] FACIAL EXPRESSION RECOGNITION ALGORITHM BASED ON MULTI-ATTENTION MECHANISM
    Wu, Huixin
    Huang, Zehuan
    Jiang, Wei
    Zhao, Xin
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2023, 19 (04): : 1239 - 1250
  • [44] Multi-Attention Network for Sentiment Analysis
    Du, Tingting
    Huang, Yunyin
    Wu, Xian
    Chang, Huiyou
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL (NLPIR 2018), 2018, : 49 - 54
  • [45] Audio-Video Fusion with Double Attention for Multimodal Emotion Recognition
    Mocanu, Bogdan
    Tapu, Ruxandra
    2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,
  • [46] Video Emotion Recognition Based on Hierarchical Attention Model
    Wang X.
    Pan L.
    Peng M.
    Hu M.
    Jin C.
    Ren F.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (01): : 27 - 35
  • [47] Two Stream Multi-Attention Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhou, Huijian
    Tian, Zhiqiang
    Du, Shaoyi
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 112 - 120
  • [48] ATTENTION DRIVEN FUSION FOR MULTI-MODAL EMOTION RECOGNITION
    Priyasad, Darshana
    Fernando, Tharindu
    Denman, Simon
    Sridharan, Sridha
    Fookes, Clinton
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3227 - 3231
  • [49] Multi-Defect Detection Network for High-Voltage Insulators Based on Adaptive Multi-Attention Fusion
    Hu, Yiming
    Wen, Bin
    Ye, Yongsheng
    Yang, Chao
    APPLIED SCIENCES-BASEL, 2023, 13 (24):
  • [50] Multi-Attention Network for Stereo Matching
    Yang, Xiaowei
    He, Lin
    Zhao, Yong
    Sang, Haiwei
    Yang, Zuliu
    Cheng, Xianjing
    IEEE ACCESS, 2020, 8 : 113371 - 113382