Fusion with Hierarchical Graphs for Multimodal Emotion Recognition

被引:0
|
作者
Tang, Shuyun [1 ]
Luo, Zhaojie [2 ]
Nan, Guoshun [4 ]
Baba, Jun [3 ]
Yoshikawa, Yuichiro [2 ]
Ishiguro, Hiroshi [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA USA
[2] Osaka Univ, Osaka, Japan
[3] CyberAgent Inc, Tokyo, Japan
[4] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
关键词
DEEP;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic emotion recognition (AER) based on enriched multimodal inputs, including text, speech, and visual clues, is crucial in the development of emotionally intelligent machines. Although complex modality relationships have been proven effective for AER, they are still largely underexplored because previous works predominantly relied on various fusion mechanisms with simply concatenated features to learn multimodal representations for emotion classification. This paper proposes a novel hierarchical fusion graph convolutional network (HFGCN) model that learns more informative multimodal representations by considering the modality dependencies during the feature fusion procedure. Specifically, the proposed model fuses multimodality inputs using a two-stage graph construction approach and encodes the modality dependencies into the conversation representation. We verified the interpretable capabilities of the proposed method by projecting the emotional states to a 2D valence-arousal (VA) subspace. Extensive experiments showed the effectiveness of our proposed model for more accurate AER, which yielded state-of-the-art results on two public datasets, IEMOCAP and MELD.
引用
收藏
页码:1288 / 1296
页数:9
相关论文
共 50 条
  • [21] Multimodal Local-Global Ranking Fusion for Emotion Recognition
    Liang, Paul Pu
    Zadeh, Amir
    Morency, Louis-Philippe
    [J]. ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 472 - 476
  • [22] Research on Multimodal Emotion Recognition Based on Fusion of Electroencephalogram and Electrooculography
    Yin, Jialai
    Wu, Minchao
    Yang, Yan
    Li, Ping
    Li, Fan
    Liang, Wen
    Lv, Zhao
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 12
  • [23] Deep Feature Extraction and Attention Fusion for Multimodal Emotion Recognition
    Yang, Zhiyi
    Li, Dahua
    Hou, Fazheng
    Song, Yu
    Gao, Qiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (03) : 1526 - 1530
  • [24] Multimodal Fusion based on Information Gain for Emotion Recognition in the Wild
    Ghaleb, Esam
    Popa, Mirela
    Hortal, Enrique
    Asteriadis, Stylianos
    [J]. PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 814 - 823
  • [25] Video Emotion Recognition in the Wild Based on Fusion of Multimodal Features
    Chen, Shizhe
    Li, Xinrui
    Jin, Qin
    Zhang, Shilei
    Qin, Yong
    [J]. ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, : 494 - 500
  • [26] Tensor Correlation Fusion for Multimodal Physiological Signal Emotion Recognition
    Shen, Jian
    Zhu, Kexin
    Liu, Huakang
    Wu, Jinwen
    Wang, Kang
    Dong, Qunxi
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [27] Driver Emotion Recognition With a Hybrid Attentional Multimodal Fusion Framework
    Mou, Luntian
    Zhao, Yiyuan
    Zhou, Chao
    Nakisa, Bahareh
    Rastgoo, Mohammad Naim
    Ma, Lei
    Huang, Tiejun
    Yin, Baocai
    Jain, Ramesh
    Gao, Wen
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2970 - 2981
  • [28] Canonical Correlation Analysis for Data Fusion in Multimodal Emotion Recognition
    Nemati, Shahla
    [J]. 2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 676 - 681
  • [29] Exploring Fusion Methods for Multimodal Emotion Recognition with Missing Data
    Wagner, Johannes
    Lingenfelser, Florian
    Andre, Elisabeth
    Kim, Jonghwa
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2011, 2 (04) : 206 - 218
  • [30] Hierarchical Attention Approach in Multimodal Emotion Recognition for Human Robot Interaction
    Abdullah, Muhammad
    Ahmad, Mobeen
    Han, Dongil
    [J]. 2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,