Multivariate, Multi-frequency and Multimodal: Rethinking Graph Neural Networks for Emotion Recognition in Conversation

被引:14
|
作者
Chen, Feiyu [1 ,2 ]
Shao, Jie [1 ,2 ]
Zhu, Shuyuan [1 ]
Shen, Heng Tao [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Sichuan Artificial Intelligence Res Inst, Yibin, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Complex relationships of high arity across modality and context dimensions is a critical challenge in the Emotion Recognition in Conversation (ERC) task. Yet, previous works tend to encode multimodal and contextual relationships in a loosely-coupled manner, which may harm relationship modelling. Recently, Graph Neural Networks (GNN) which show advantages in capturing data relations, offer a new solution for ERC. However, existing GNN-based ERC models fail to address some general limits of GNNs, including assuming pairwise formulation and erasing high-frequency signals, which may be trivial for many applications but crucial for the ERC task. In this paper, we propose a GNN-based model that explores multivariate relationships and captures the varying importance of emotion discrepancy and commonality by valuing multi-frequency signals. We empower GNNs to better capture the inherent relationships among utterances and deliver more sufficient multimodal and contextual modelling. Experimental results show that our proposed method outperforms previous state-of-the-art works on two popular multimodal ERC datasets.
引用
收藏
页码:10761 / 10770
页数:10
相关论文
共 50 条
  • [1] Multimodal Conversation Emotion Recognition Combining Multi- Level Attention and Multi-Stream Graph Neural Networks
    Feng, Hongqi
    Guo, Yongxiang
    Zhang, Denghui
    Yang, Xinli
    Computer Engineering and Applications, 2024, 60 (21) : 154 - 163
  • [2] FrameERC: Framelet Transform Based Multimodal Graph Neural Networks for Emotion Recognition in Conversation
    Li, Ming
    Shi, Jiandong
    Bai, Lu
    Huang, Changqin
    Jiang, Yunliang
    Lu, Ke
    Wang, Shijin
    Hancock, Edwin R.
    PATTERN RECOGNITION, 2025, 161
  • [3] Multimodal Decoupled Distillation Graph Neural Network for Emotion Recognition in Conversation
    Dai, Yijing
    Li, Yingjian
    Chen, Dongpeng
    Li, Jinxing
    Lu, Guangming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9910 - 9924
  • [4] Multimodal Emotion Recognition Using Compressed Graph Neural Networks
    Durkic, Tijana
    Simic, Nikola
    Bajovic, Sinisa Suzie Dragana
    Peric, Zoran
    Delic, Vladan
    SPEECH AND COMPUTER, SPECOM 2024, PT II, 2025, 15300 : 109 - 121
  • [5] Multimodal Emotion Recognition Method Based on Domain Generalization and Graph Neural Networks
    Xie, Jinbao
    Wang, Yulong
    Meng, Tianxin
    Tai, Jianqiao
    Zheng, Yueqian
    Varatnitski, Yury I.
    ELECTRONICS, 2025, 14 (05):
  • [6] Multimodal graph learning with framelet-based stochastic configuration networks for emotion recognition in conversation
    Shi, Jiandong
    Li, Ming
    Chen, Yuting
    Cui, Lixin
    Bai, Lu
    INFORMATION SCIENCES, 2025, 686
  • [7] Masked Graph Learning With Recurrent Alignment for Multimodal Emotion Recognition in Conversation
    Meng, Tao
    Zhang, Fuchen
    Shou, Yuntao
    Shao, Hongen
    Ai, Wei
    Li, Keqin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4298 - 4312
  • [8] Hierarchical heterogeneous graph network based multimodal emotion recognition in conversation
    Peng, Junyin
    Tang, Hong
    Zheng, Wenbin
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [9] MMDAG: Multimodal Directed Acyclic Graph Network for Emotion Recognition in Conversation
    Xu, Shuo
    Jia, Yuxiang
    Niu, Changyong
    Zan, Hongying
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6802 - 6807
  • [10] DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation
    Ghosal, Deepanway
    Majumder, Navonil
    Poria, Soujanya
    Chhaya, Niyati
    Gelbukh, Alexander
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 154 - 164