Hierarchically stacked graph convolution for emotion recognition in conversation

被引:17
|
作者
Wang, Binqiang [1 ,2 ]
Dong, Gang [2 ]
Zhao, Yaqian [2 ]
Li, Rengang [2 ]
Cao, Qichun [2 ]
Hu, Kekun [2 ]
Jiang, Dongdong [2 ]
机构
[1] Shandong Mass Informat Technol Res Inst, Jinan, Peoples R China
[2] Inspur Beijing Elect Informat Ind Co Ltd, Beijing, Peoples R China
关键词
Emotion recognition; Graph neural network; Residual connection; Transformer;
D O I
10.1016/j.knosys.2023.110285
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate emotion recognition can drive the robot to understand human affection intentions precisely and deliver the emotional response when communicating with a person. Recently, graph structure has been applied to explicitly capture the self and inter-dependencies of speakers in the conversa-tion. However, the performance of the method is limited by inadequate discriminative information extraction based on naive graph convolution. In this paper, we propose a novel Hierarchically Stacked Graph Convolution Framework (HSGCF), which leverages hierarchical structure to extract emotional discriminative features. The proposed HSGCF uses five graph convolution layers connected hierarchically to establish a more discriminative emotional feature extractor. More importantly, to mitigate the over-smooth problem caused by deeper networks, Transformer structures with residual connection are introduced into HSGCF. Experimental results on the IEMOCAP benchmark dataset indicate the proposed framework achieves a 4.12% improvement in accuracy and a 4.80% improvement in F1 score compared with the baseline method. (c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Dual-level constraint based distributed graph convolution network for multimodal emotion recognition in conversation
    Xiang, Yan
    Wang, Lu
    Tan, Xiaocong
    Guo, Junjun
    NEUROCOMPUTING, 2025, 618
  • [2] Progressive graph convolution network for EEG emotion recognition
    Zhou, Yijin
    Li, Fu
    Li, Yang
    Ji, Youshuo
    Shi, Guangming
    Zheng, Wenming
    Zhang, Lijian
    Chen, Yuanfang
    Cheng, Rui
    NEUROCOMPUTING, 2023, 544
  • [3] I-GCN: Incremental Graph Convolution Network for Conversation Emotion Detection
    Nie, Weizhi
    Chang, Rihao
    Ren, Minjie
    Su, Yuting
    Liu, Anan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4471 - 4481
  • [4] DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation
    Ghosal, Deepanway
    Majumder, Navonil
    Poria, Soujanya
    Chhaya, Niyati
    Gelbukh, Alexander
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 154 - 164
  • [5] A Multi-Dimensional Graph Convolution Network for EEG Emotion Recognition
    Du, Guanglong
    Su, Jinshao
    Zhang, Linlin
    Su, Kang
    Wang, Xueqian
    Teng, Shaohua
    Liu, Peter Xiaoping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [6] Emotion Recognition in Conversation Based on a Dynamic Complementary Graph Convolutional Network
    Yang, Zhenyu
    Li, Xiaoyang
    Cheng, Yuhu
    Zhang, Tong
    Wang, Xuesong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (03) : 1567 - 1579
  • [7] Masked Graph Learning With Recurrent Alignment for Multimodal Emotion Recognition in Conversation
    Meng, Tao
    Zhang, Fuchen
    Shou, Yuntao
    Shao, Hongen
    Ai, Wei
    Li, Keqin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4298 - 4312
  • [8] Hierarchical heterogeneous graph network based multimodal emotion recognition in conversation
    Peng, Junyin
    Tang, Hong
    Zheng, Wenbin
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [9] Multimodal Decoupled Distillation Graph Neural Network for Emotion Recognition in Conversation
    Dai, Yijing
    Li, Yingjian
    Chen, Dongpeng
    Li, Jinxing
    Lu, Guangming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9910 - 9924
  • [10] MMDAG: Multimodal Directed Acyclic Graph Network for Emotion Recognition in Conversation
    Xu, Shuo
    Jia, Yuxiang
    Niu, Changyong
    Zan, Hongying
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6802 - 6807