A graph neural network with context filtering and feature correction for conversational emotion recognition

被引:6
|
作者
Gan, Chenquan [1 ,2 ]
Zheng, Jiahao [1 ]
Zhu, Qingyi [2 ]
Kumar, Deepak [3 ,4 ]
Struc, Vitomir [5 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Sch Cyber Secur & Informat Law, Chongqing 400065, Peoples R China
[3] Dalian Univ Technol, Sch Artificial Intelligence, Key Lab Intelligent Control & Optimizat Ind Equipm, Minist Educ, Dalian 116024, Peoples R China
[4] Symbiosis Int Univ, Symbiosis Inst Technol, Pune 412115, India
[5] Univ Ljubljana, Fac Elect Engn, Trzaska Cesta 25, SI-1000 Ljubljana, Slovenia
关键词
Conversational emotion recognition; Context filter; Feature correction; Graph network; SENTIMENT; MODEL;
D O I
10.1016/j.ins.2023.120017
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conversational emotion recognition represents an important machine-learning problem with a wide variety of deployment possibilities. The key challenge in this area is how to properly capture the key conversational aspects that facilitate reliable emotion recognition, including utterance semantics, temporal order, informative contextual cues, speaker interactions as well as other relevant factors. In this paper, we present a novel Graph Neural Network approach for conversational emotion recognition at the utterance level. Our method addresses the outlined challenges and represents conversations in the form of graph structures that naturally encode temporal order, speaker dependencies, and even long-distance context. To efficiently capture the semantic content of the conversations, we leverage the zero-shot feature-extraction capabilities of pre-trained large-scale language models and then integrate two key contributions into the graph neural network to ensure competitive recognition results. The first is a novel context filter that establishes meaningful utterance dependencies for the graph construction procedure and removes low-relevance and uninformative utterances from being used as a source of contextual information for the recognition task. The second contribution is a feature-correction procedure that adjusts the information content in the generated feature representations through a gating mechanism to improve their discriminative power and reduce emotion-prediction errors. We conduct extensive experiments on four commonly used conversational datasets, i.e., IEMOCAP, MELD, Dailydialog, and EmoryNLP, to demonstrate the capabilities of the developed graph neural network with context filtering and error-correction capabilities. The results of the experiments point to highly promising performance, especially when compared to state-of-the-art competitors from the literature.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Multimodal Decoupled Distillation Graph Neural Network for Emotion Recognition in Conversation
    Dai, Yijing
    Li, Yingjian
    Chen, Dongpeng
    Li, Jinxing
    Lu, Guangming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9910 - 9924
  • [22] Conversational Emotion Recognition Incorporating Knowledge Graph and Curriculum Learning
    Du J.
    Sun Y.
    Lin H.
    Yang L.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (05): : 1299 - 1309
  • [23] Speech Emotion Recognition with Heterogeneous Feature Unification of Deep Neural Network
    Jiang, Wei
    Wang, Zheng
    Jin, Jesse S.
    Han, Xianfeng
    Li, Chunguang
    SENSORS, 2019, 19 (12)
  • [24] Graph Convolutional Neural Network Based Emotion Recognition with Brain Functional Connectivity Network
    Gao, Pengzhi
    Zheng, Xiangwei
    Wang, Tao
    Zhang, Yuang
    International Journal of Crowd Science, 2024, 8 (04) : 195 - 204
  • [25] Towards Contrastive Context-Aware Conversational Emotion Recognition
    Zhang, Hanqing
    Song, Dawei
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (04) : 1879 - 1891
  • [26] GraphCFC: A Directed Graph Based Cross-Modal Feature Complementation Approach for Multimodal Conversational Emotion Recognition
    Li, Jiang
    Wang, Xiaoping
    Lv, Guoqing
    Zeng, Zhigang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 77 - 89
  • [27] Edge adjacency graph and neural network architecture for machining feature recognition
    Li, Yang
    Li, Eugene
    Lenover, Michael
    Mann, Stephen
    Bedi, Sanjeev
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2025, 136 (02): : 897 - 908
  • [28] LR-GCN: Latent Relation-Aware Graph Convolutional Network for Conversational Emotion Recognition
    Ren, Minjie
    Huang, Xiangdong
    Li, Wenhui
    Song, Dan
    Nie, Weizhi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4422 - 4432
  • [29] EEG emotion recognition using improved graph neural network with channel selection
    Lin, Xuefen
    Chen, Jielin
    Ma, Weifeng
    Tang, Wei
    Wang, Yuchen
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 231
  • [30] Context- and Knowledge-Aware Graph Convolutional Network for Multimodal Emotion Recognition
    Fu, Yahui
    Okada, Shogo
    Wang, Longbiao
    Guo, Lili
    Song, Yaodong
    Liu, Jiaxing
    Dang, Jianwu
    IEEE MULTIMEDIA, 2022, 29 (03) : 91 - 99