Multivariate, Multi-frequency and Multimodal: Rethinking Graph Neural Networks for Emotion Recognition in Conversation

被引:14
|
作者
Chen, Feiyu [1 ,2 ]
Shao, Jie [1 ,2 ]
Zhu, Shuyuan [1 ]
Shen, Heng Tao [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Sichuan Artificial Intelligence Res Inst, Yibin, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Complex relationships of high arity across modality and context dimensions is a critical challenge in the Emotion Recognition in Conversation (ERC) task. Yet, previous works tend to encode multimodal and contextual relationships in a loosely-coupled manner, which may harm relationship modelling. Recently, Graph Neural Networks (GNN) which show advantages in capturing data relations, offer a new solution for ERC. However, existing GNN-based ERC models fail to address some general limits of GNNs, including assuming pairwise formulation and erasing high-frequency signals, which may be trivial for many applications but crucial for the ERC task. In this paper, we propose a GNN-based model that explores multivariate relationships and captures the varying importance of emotion discrepancy and commonality by valuing multi-frequency signals. We empower GNNs to better capture the inherent relationships among utterances and deliver more sufficient multimodal and contextual modelling. Experimental results show that our proposed method outperforms previous state-of-the-art works on two popular multimodal ERC datasets.
引用
收藏
页码:10761 / 10770
页数:10
相关论文
共 50 条
  • [31] SERC-GCN: SPEECH EMOTION RECOGNITION IN CONVERSATION USING GRAPH CONVOLUTIONAL NETWORKS
    Chandola, Deeksha
    Altarawneh, Enas
    Jenkin, Michael
    Papagelis, Manos
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 76 - 80
  • [32] Bayesian Graph Neural Networks for EEG-Based Emotion Recognition
    Chen, Jianhui
    Qian, Hui
    Gong, Xiaoliang
    CLINICAL IMAGE-BASED PROCEDURES, DISTRIBUTED AND COLLABORATIVE LEARNING, ARTIFICIAL INTELLIGENCE FOR COMBATING COVID-19 AND SECURE AND PRIVACY-PRESERVING MACHINE LEARNING, CLIP 2021, DCL 2021, LL-COVID19 2021, PPML 2021, 2021, 12969 : 24 - 33
  • [33] EEG Emotion Recognition Using Dynamical Graph Convolutional Neural Networks
    Song, Tengfei
    Zheng, Wenming
    Song, Peng
    Cui, Zhen
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2020, 11 (03) : 532 - 541
  • [34] Dual-level constraint based distributed graph convolution network for multimodal emotion recognition in conversation
    Xiang, Yan
    Wang, Lu
    Tan, Xiaocong
    Guo, Junjun
    NEUROCOMPUTING, 2025, 618
  • [35] Hierarchically stacked graph convolution for emotion recognition in conversation
    Wang, Binqiang
    Dong, Gang
    Zhao, Yaqian
    Li, Rengang
    Cao, Qichun
    Hu, Kekun
    Jiang, Dongdong
    KNOWLEDGE-BASED SYSTEMS, 2023, 263
  • [36] Rethinking pooling in graph neural networks
    Mesquita, Diego
    Souza, Amauri H.
    Kaski, Samuel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [37] Multimodal Continuous Emotion Recognition with Data Augmentation Using Recurrent Neural Networks
    Huang, Jian
    Li, Ya
    Tao, Jianhua
    Lian, Zheng
    Niu, Mingyue
    Yang, Minghao
    PROCEEDINGS OF THE 2018 AUDIO/VISUAL EMOTION CHALLENGE AND WORKSHOP (AVEC'18), 2018, : 57 - 64
  • [38] Multimodal Facial Emotion Recognition Using Improved Convolution Neural Networks Model
    Udeh, Chinonso Paschal
    Chen, Luefeng
    Du, Sheng
    Li, Min
    Wu, Min
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (04) : 710 - 719
  • [39] HiMul-LGG: A hierarchical decision fusion-based local-global graph neural network for multimodal emotion recognition in conversation
    Fu, Changzeng
    Qian, Fengkui
    Su, Kaifeng
    Su, Yikai
    Wang, Ze
    Shi, Jiaqi
    Liu, Zhigang
    Liu, Chaoran
    Ishi, Carlos Toshinori
    NEURAL NETWORKS, 2025, 181
  • [40] End-to-End Multimodal Emotion Recognition Using Deep Neural Networks
    Tzirakis, Panagiotis
    Trigeorgis, George
    Nicolaou, Mihalis A.
    Schuller, Bjorn W.
    Zafeiriou, Stefanos
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1301 - 1309